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WHAT IS CLAIMED IS: 

1. An isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of: 

(a) a mature form of an amino acid sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20; 

(b) a variant of a mature form of an amino acid sequence selected from the group 
consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20, wherein one or more 
amino acid residues in said variant differs from the amino acid sequence of said mature 
form, provided that said variant differs in no more than 15% of the amino acid residues 
from the amino acid sequence of said mature form; 

(c) an amino acid sequence selected from the group consisting of SEQ ID NOS:2, 4, 6, 8, 
10, 12, 14, 16, 18 and 20; and 

(d) a variant of an amino acid sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20, wherein one or more amino acid residues in 
said variant differs from the amino acid sequence of said mature form, provided that 
said variant differs in no more than 15% of amino acid residues from said amino acid 
sequence. 

2 The polypeptide of claim 1 , wherein said polypeptide comprises the amino acid 
sequence of a naturally-occurring allelic variant of an amino acid sequence selected from the 
group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20. 

3. The polypeptide of claim 2, wherein said allelic variant comprises an amino 
acid sequence that is the translation of a nucleic acid sequence differing by a single nucleotide 
from a nucleic acid sequence selected from the group consisting of SEQ ID NOS:l, 3, 5, 7, 9, 
11, 13, 15, 17 and 19. 

4. The polypeptide of claim 1 , wherein the amino acid sequence of said variant 
comprises a conservative amino acid substitution. 
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5. An isolated nucleic acid molecule comprising a nucleic acid sequence encoding 
a polypeptide comprising an amino acid sequence selected from the group consisting of; 

(a) a mature form of an amino acid sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20; ■ 

(b) a variant of a mature" form of an amino acid sequence selected from the group 
consisting of SEQ JD NOS:2, 4,6*8,10, 12* 14, 16, 18 and 20, wherein one or more 
amino acid residues in said variant dififtra -from the amino acid sequence of said mature 
form, provided that saidjvar^it diffei^|u no more than 15% of the amino acid residues 
from the amino acid sequence of said mature to^^^iCii^ : 

(c) an amino acid sequra^ of SEQ ID NOS:2, 4, 6, 8, 
10, 12, 14, 16, 18 and 20; _ ^_- v ^- ; , _ 

(d) a variant of an amino acid sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 1 8 and 20, wherein one ormpre amino acid residues in 
said variant differs ^ fom ? tJte. ac^^f^d ^^^^ that 
said variant differs in no more, than 15% of amino acid residues from said amino acid 
sequence; 

(e) a nucleic add fragment encodtog^atr^^ a ifelj^eptide <kSi{^Sing ■*a5S 
amino Trad ^ 8, 10, 
12, 14, 16, 18 tod 20, or i v^mt of ^ or more amino acid 
residues in said variant differs from the amino acid sequence, of said mature form, 
provide tl^^ from said 
amino acid sequence; and _ ^ - ^ ~ : 

(f) a nucleic acid molecule comprising the co^ of (a), (b), (c^.(d) or (e). : 

6. The nucleic acid molecule of claim 5 ? wherein the-nucleic acid molecule 
comprises the nucleotide sequence of anatui^y-occurring allelic nucleic acid variant. 

7. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule 
encodes a polypeptide comprising the amino acid sequence of a naturally-occurring 
polypeptide variant ; . 



8. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule differs 
by a single nucleotide from a nucleic acid sequence selected from the group consisting of SEQ 
IDNOS:l, 3, 5, 7, 9, 11, 13, 15, 17 and 19. 
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9. The nucleic acid molecule of claim 5, wherein said nucleic acid molecule 
comprises a nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence selected from the group consisting of SEQ ID NOS: 1,3, 
5, 7, 9,11,13, 15, 17 and 19; 

(b) a nucleotide sequence differing by one or more nucleotides from a nucleotide 
sequence selected from the group consisting of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17 and 
19, provided that no more than 20% of the nucleotides differ from said nucleotide sequence; 

(c) a nucleic acid fragment of (a); and 

(d) a nucleic acid fragment of (b). 

10. The nucleic acid molecule of claim 5, wherein said nucleic acid molecule 
hybridizes under stringent conditions to a nucleotide sequence chosen from the group 
consisting of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17 and 19, or a complement of said 
nucleotide sequence. 

1 1 . The nucleic acid molecule of claim 5, wherein the nucleic acid molecule 
comprises a nucleotide sequence selected from the group consisting of: 

(a) a first nucleotide sequence comprising a coding sequence differing by one or more 
nucleotide sequences from a coding sequence encoding said amino acid sequence, 
provided that no more than 20% of the nucleotides in the coding sequence in said first 
nucleotide sequence differ from said coding sequence; 

(b) an isolated second polynucleotide that is a complement of the first polynucleotide; and 

(c) a nucleic acid fragment of (a) or (b). 

12. A vector comprising the nucleic acid molecule of claim 1 1 . 

13 . The vector of claim 12, further comprising a promoter operably-linked to said 
nucleic acid molecule. 

14. A cell comprising the vector of claim 12. 

15. An antibody that binds immunospecifically to the polypeptide of claim 1 . 
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1 6. The antibody of claim 15, wherein said antibody is a monoclonal antibody. 

17. The antibody of claim 15, wherein the antibody is a humanized antibody. 

~ 1 8r A method for determinmg the presence br amouhtlof the polypeptide of claim 1 
m a sample, Ihem^c^c^^ - ^ „ . - . . ^ 

(a) providing the sample; 

(b) contacting the sample with an antibody that binds immunospecifically to the 
polypeptide; and 

(c) detenhinii^thep^ 

thereby detennining the presence or amount of polypeptide in said sample. 

i";;hie <--D aiiaivsis results tor *;y?,?4 

^ 19.. . A irifltbq^J^ or amount of the nucleic acid molecule 

ofdainvSinasaeopl^ ^ 

(a) providing the sample; c : — : ^ ^ a 

(b) contacting the sample with a probe that binds to said nucleic acid molecule; and 

(c) determining the presence or amount of the probe bound to said nucleic acid molecule, 
thereby determining the presence --^'amdimf^fi^i^ldi& acktrrielecule in said sample. 

20. The melted of claim 19^Ira 
molecule is used as a marker for jceli pr tissue : type. 

; /_ 21. 7: -r the cell or tissue type is cancerous. 

. 22: - A method of identifying an agent that binds to a polypeptide of claim 1, the 
method comprising: 1 

(a) contacting said polypeptide with said agent; and 

(b) determining whether ^ said agenVbinds to said polypeptide. 

23! The method of claim 22 wherein the agent is a cellular receptor or a 
downstream effector. 

24. A method for identifying an agent that modulates the expression or activity of 
the polypeptide of claim 1, the method comprising: 

129 



WO 01/74897 



PCT/US01/10892 



(a) providing a cell expressing said polypeptide; 

(b) contacting the cell with said agent, and 

(c) determining whether the agent modulates expression or activity of said 
polypeptide, 

whereby an alteration in expression or activity of said peptide indicates said agent 
modulates expression or activity of said polypeptide. 

25. A method for modulating the activity of the polypeptide of claim 1 , the method 
comprising contacting a cell sample expressing the polypeptide of said claim with a compound 
that binds to said polypeptide in an amount sufficient to modulate the activity of the 
polypeptide. 

26. A method of treating or preventing a AMFX-associated disorder, said method 
comprising administering to a subject in which such treatment or prevention is desired the 
polypeptide of claim 1 in an amount sufficient to treat or prevent said AMFX-associated 
disorder in said subject. 

27. The method of claim 26 wherein the disorder is selected from the group 
consisting of disorders related to cell signal processing, cell adhesion or migration pathway 
modulation, chemoresistance, radiotherapy resistance, survival in trophic factor limited 
secondary tissue site microenvironments, connective tissue disorders, tissue remodeling, 
oncogenesis, cancer of the breast, ovary, cervix, prostate, endometrium, stomach, colon, lung, 
bladder, kidney, brain, and soft-tissue, cellular transformation, developmental tissue 
remodeling, inflammation, blood clot formation and resorption, hematopoiesis, angiogenesis, 
multidrug resistance related to organic anion transporters, malignant disease progression, 
autocrine and paracrine regulation of cell growth, and cellular responses to external stimuli, 
and other diseases, disorders and conditions of the like. 

28. The method of claim 26 wherein the disorder is related to cell signal processing 
and metabolic pathway modulation. 

29. The method of claim 26, wherein said subject is a human. 
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30. A method of treating or preventing a AMFX-associated disorder, said method 
comprising administering to a subject in which such treatment or prevention is desired the ^ 
nucleic acid of claim 5 in an amount sufficient to treat or prevent said AMFX-associated 
disorder in said subject 

3 1 . The method of claim 3 0 wherein the disorder is selected from die group 
consisting of consisting of disordered or ^ " 
migration pathway modulation, cKemOT^ in trophic 
factor limted se^nd^ tissue disorders, tissue 
remodeling, oncogenesis, cancer of the breast, ovary, cervix, prostate, endometrium, stomach, 
colon, lung, bladder, K^e^braioC ah& £<ji&i^ developmental 
tissue remodeling, inflammation, blood clot formation ami resorption, hematopoiesis, 
angiogenesis, multidrug resistance related to o^amVaruoif&anspdrterS; malignant disease 
progression, autocrine aghd £axac^ to 
external stimuli; and other diseases, disorders and cbi^ lite. * 



32. The method of claim 30 wherein the disorder is related to cell signal processing 
and metabolic pathway modiilalfeiiiic-r.-air^. :r, .-.MF\ us -y~II ~li ctb?x ^ H'FX -r^^~x am 



\ 



33. The method of clmm 30; ^ - ^ 

34. ^ 'Anffiflk^ said method 
comprising administering to a subject in which such treatment or prevention is desired the 
antibody of claim 1 fTin an amount siifficient to treat or prevent said AlvdFX-associated disorder 
in said subject" 

35. Themefot^of clmm^^ 

consisting of disorders related to celLsignal processing, c$U adhesion or migration, pathway 
modulation, chemoreastan<^, radiotherapy resistance, survival in. trophic factor limited 
secondary tissue site microenvironments, connective tissue disorders, tissue remodeling, 
oncogenesis, cancer of the breast, ovary, cervix, prostate, endometrium, stomach, colon, lung, 
bladder, kidney, brain, and soft-tissue, cellular transformation, developmental tissue 
remodeling, inflammation, blood clot formation and resorption, hematopoiesis, angiogenesis, 
multidrug resistance related to organic anion transporters, malignant disease progression, 
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autocrine and paracrine regulation of cell growth, and cellular responses to external stimuli, 
and other diseases, disorders and conditions of the like. 

36. The method of claim 34 wherein the disorder is related to cell signal processing 
and metabolic pathway modulation. 

37. The method of claim 34, wherein the subject is a human. 

38. A pharmaceutical composition comprising^e polypeptide of claim 1 and a 
pharmaceutically-acceptable carrier. 

39. A pharmaceutical composition comprising the nucleic acid molecule of claim 5 : 
and a pharmaceutically-acceptable carrier. 

40. A pharmaceutical composition comprising the antibody of claim 1 5 and a 
pharmaceutically-acceptable carrier. 

41 . A kit comprising in one or more containers, the pharmaceutical composition of 
claim 38. 

42. A kit comprising in one or more containers, the pharmaceutical composition of 
claim 39. 

43. A kit comprising in one or more containers, the pharmaceutical composition of 
claim 40. 

44. A method for determining the presence of or predisposition to a disease 
associated with altered levels of the polypeptide of claim 1 in a first mammalian subject, the 
method comprising: 

(a) measuring the level of expression of the polypeptide in a sample from the first 
mammalian subject; and 

(b) comparing the amount of said polypeptide in the sample of step (a) to the amount of the 
polypeptide present in a control sample from a second mammalian subject known not 
to have, or not to be predisposed to, said disease; 
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wherein an alteration in the expression -level of the polypeptide in the first subject as 
compared to the control sample indicates the presence of or predisposition to said disease. 

45. The methodrof claim 44 wherein the predisposition is to cancers. 

46. A method for determining the presence of or predisposition to a disease 
Associated witfrlit^ a first mammalian 
subj(^then^^ 

(a) measffiti^lB^^diint of the nucleic acid in a sample from the first mammalian subject; 

(h) nuclei© aediyi thaswipl^of; step (a) to the amoimt of the 

-.: nuclek; acid iKeacnt-in a control sample a second mammalian subject known not 
fo hav^iiC* beT*r^^ ^ brin of bi^oo o u-*?. s -c 

- - wherein aaralte^ afcfcf in the 1 first sebject as compared to 

the control Sample indicate tKe p^ 

47. Them^ wherein^tiie ^ 

.48. Amethod of trea^ 
administering. to the m$umn^ alleviate the 

pathok>gieal state r wju^ a -Polypeptide bfving a j v a mino acid sequence at 

least 95% identical ^to a polypeptide comprising antaroino arid, sequence o£at least one of SEQ 
ED NOS:2, 4ij5i 8*;1Q, 12, 14^ 46, 18 and20i:or a biologically active fragment thereof. 

- 4SK ~ A method of treating a pathological state in a mammal, the method comprising 
admihi&ermg to the mammal the antibody of claim 1 5 in an amount sufficient to alleviate the 
pdhologictf ^tei : : : ; - z 
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The AMF3 nucleic acids and proteins are useful in potential diagnostic and therapeutic 
applications implicated in various diseases and disorders described below and/or other ' 
pathologies. For example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from: Marfan syndrome, congenital contractual 

5 arachnodactyly, Marfen-like habitus, familial adenomatous polyposis and other diseases, 

disorders and conditions of me Kkel By way of nominating example, me compositions of the 
present mvetroon wiil have efficacy for treatment of patients suffering from Marfan syndrome, 
congenital cantractural arachnodactyly, Marfinvlike habitus, fan^ai adenoniatous poryp^ 
Additional AMF3-related 'd^aM'hidM^aki»^ai mentioned tiooi^^bdiiit'ute'^pecifiDtfioife^ 

10 Farmer^ i^^^^ja^S^iB^iSaM^ &qae&<aipiM^aa& map location for 

AI^ suggests ti^'^ 

characteristic of me A^Stadry. thereforei me nucleic 

useful in potential magnostic an^lterapeutic appKcatiohs arid a^arestSarefrtedi; TiwsemcTwIe 
serving as a specific or selective nucleic acid or protein diagnostic and/or prognostic marker, 
15 wheremW'ines^^'^a^m^^lfiS nulclelc a^orbrme^ be" assess^ as ; well as 

potential thera^in^ 
molecule drug Wg^^ 

antibody), nukeSc^Mi usefal in gene r uiierapy (gehVdenH^/gene ablation); and (v>a 
composition promo^ti^^ vfvo CvO biological defense weapon. 

20 'Th^ materials 1 ate 

hnmmiospecrficairyto^ 

memods: These a^ "*» 
prediction froifi ^hy^roplaoD^ch^as des^ybed mti^ "Anr^ section 
below. Id varibus emb^nnKaats, conteinpiated AMF3 epitopes are hydrophilic regions of the 
25 AMF3 polypeptide^ 

hydrophobfcity or hyaroph^citjr plots; 

AMF-4 (also»referr*dct©;as- -AcfcNo. .274W7^-: n ,i.-. ::T .-, r: .r 

Novel AMF4 is rplasmmogen-like protem; The AMF4 clone is alternatively referred 
to herein as Acc: Nd 27486474, The AMF4 nucleic acid of 439 nucleotides is shown hvTable 
30 4A. The AMF4 operi reading frame ("ORF^) begins ai positions 2-5. The AMF40RF 

tenninates at a TAA codon at nucleotides 93-95. As shown in Table 4A, putative untranslated 
regions^ to te stop codan ert\x^]w& and the stop codon is in bold letters. AMF4 does 
not begin at an ATG startsite; so it is most likely a C-ternrinal ooding fragment It is 
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; '^'4:V,-. ■■ 
contemplated that the AMF4 ORF extends in the 5' direction of the nucleic acid (SEQ ID 

NO:7) and the N-terrninal direction of the polypeptide (SEQ ID NO:8). 

Table 4A. AMF4 nucleic acid (SEQ ID NO:7) 

X CAC GGG AAT AAG CCT GOG CCC GTC CCT TTG ATT TCC AAC AAG ATC 
: TGC AAC CAC AGO GftC QTQ TAG GOT GGC ATC ATC TCC CCC TCC ATG 
CI^TGC GCQ; GGCcT^ 
. GAC AGC GGG. GGG CCC CTG-GTG TGT CAA GAG AGG AGO CTG TGG AAG 
TTA GTG GGA GCG ACC AGC TTT GGC ATC GGC TGC OCA GAG GTG AAC 
AAG CCT GGG GTG TAC ACC GTG *TCA CCT CCT TCC TGG ACT* GGA TOCT " 
ACG AGC AGA TGG AGA GAG ACC TAA AAA COT QAA GAG GAA GOG GAT 
njifl TAG c g ai iggpr^sag; : ttc groFifcra <^ GAff CCT .-. 

.I ^ ™ QAH. TCG CGT GTA GGA ACC, ITiT ftVTft fflft SCft fiftC ACC CTT , .. . ; , 
QQA GCT CT^ AST . TC C GGC ACC AGT AGC AGG CCC 

5 The encoci^ (SHSQ © I^r^irai^ 1 ^^ ^^affi3&ier^ 

acid code in Table 4B. 

-"***-'' v»«: L: .'v-'r-.Tr^-nKv: serins r-ro'^^vi broi^in may b« useful in ^ene theraov, -~d me 
Table 4A. AMF4 polypeptide (SEQ ID NO:8) 

HGNKPGPVPLISNKICMHRIJVyGGIISPSMIX!AGYI*TGGVDS 



GATSFGIGCAEVNKPGVYTVSPPSWTGSTSRWRKT ' 



'itiWrnaiy^ 

(GenBank Acc. Nof AB038159) (SEQ ID NO:69) shown in Table 4C. In all BLAST 
alignments heieii^ tlie "J^Vtf^ 

thai tte alig^^ sequence 
by cbance alcme/ witbfc tte d«Gtf^ r ^ was 's&ix±& r FW^ i&'hx&rii ia Table 4C, 

15 theprob^iIity1I& 

case ^ Serine |i6fe^ r ^^ft)^n, i£ittcfc& tte Qti^ABf^ by chance i& 

zOT/EvsdueOJOr 

Tai£ 4G. BLASTN of AMF4 against ABQ38159 (SEO ID NO:69) 

>AB038159 H- sapiens TMPRflS3a xnKNA for serine protease, aosqplete ode. 1/2001 
20 Length'- -2135. * 1 Strandi:* - Plus=/-PIu*^ r r.r: ; .-. :, - ;j .^ v --rr-. s :™ — - n;-~ •-.»• 

score » 809 bits (408), Bxpect « 0.0 

identities , 4 l8/4ao : vOS%),. Gape \-:;i/«0-. s -<«*.^. ■ . J:. J.' 
Query* 21 ecgteectttgattteeaa*aegatetgoaacaaoagggaa 80 
Sbjcfcr 950 ccgtccctttgatttc«^ ■ LO V" , 

Queryr 81. - . cecrctccatgct^tTO.<OTgctacc . .. - 

1 1 1 1 1 1 1 riTi i ri ill i mil n it ft rn i ri irri 1 1 1.1 iti-i 1 1 1 1 rTn 1-1 ri i " » 

30 Sbjct: 1010 ccccctceatgctetgcgcgggetacctgacsgggtggcgtggacagctgeeagggggaca 1069 
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Query: 141 gcggggggcccctggtgtgtcsagagaggaggctgtggaagttagtgggagcgaccagct 200 

iiiiiiiimiiiiii mimmiii milium iiiiimiiimn mi 

^ Sbjott 1070 goggggggaocctggtgtgtoaagagaggaggotgtggaagttagtgggagcgaooagot 1129 
Query: 201 ttggcatcggctgcgcagaggtgaacaagcctggggtgtaca-ccgtgtcacctccttcc 259 

1 1 1 1 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 m m i rt ii 1 1 1 1 1 1 1 mimiiiiiii iii 

Sbjct: 1130 ttggcatcggctgcgcagaggtgaacaagcctggggtgtacacccgtgtcaectecttcc 2189 
10 -Query: 260 tggactggatccaegagcagatggagagagacetaaaaaectgaagaggaaggggataag 319 

I M 1 1 1 1 1 1 IJ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 H 1 1 1 II 1 1 1 1 1 1 1 1 1 III 

- ~ Sb j et *: 1190 tggactggatccacgagcagatggagagagacc t aaaaacc tgaagaggaaggggacaag 1249 



50 



Query : 3 2 0 t agccacctgagt t cc t gaggtgatgaagacagcccga ;t ec tocce tggact a ccgtgt a 379 

15 IMI II lllllll II Mil II 1 1 llllllill II III 1 1 1 llllillllltlMIII 1 1 1 - 

Sb j ct : - \12S& tagccacctgagttcctgagcrtgatgaagaca^dccgat cctcccctggactcccgtgta 13 09 
Query t- 390- -ggaabctgeaca^ 

- I I 1 1 1 1 1 1 1 1 1 1 U 1 1 1 1 1 11 1 1 1 1 1 M il I I LL! M.I 1 1 JIUI 1111 LI I I -1 .1. Ill 

20 sbj ct r 1310- ggaacct^ 

• , c , v^r,: 1 :-.'::;;' ; ouii^';,i:i, .-jatcriijOirtii-d aM.F^ epilogs :-re ;r.. avr.r.ii:-.: .-,^.:,h 
Additional BLASTN infonnalira^ 

41). _ _ ^ — v 

Table 4D BLASTN analysis results for AMF4 

25 Soore B 

Sequences producing significant alignments s (bit*) Valna 

AB03 9159 ABQ3 8159 Bomb sapiens- TMERSSXc^ JnRNJL f or eexirie : pr^ ; : . ; ^ : 809 ' 0 . 0 " rr: : : " - 
AB038158 AB038158 Homo sapiens TMPRSS3b mRNA for serine jocpt,.,*.*-, .809 0.0 
AB03 81 5 7 AB03815 7; : Homo* Sa^lensi TKPBB&3 a< mRNA - for serine prot-.. * 809 1 d ; 0 ' 
30 AF201380 AF201380 Homo sapiens serine protease TADG12 raRNA, . . . 753 0-0 

AP001746 AP00174R Homtt sapieiia; gehc«tiic BMA^ ^ ^303? 2e~79 ' 

AP001623 AP001623 Homo sapiens genomic DNA, chromosome 21, c... 301 2e-79 
AC01S5S5 AC01555S Somp atpieMfghrm^oiaa^l olene> RP1*1- 11 A? ^ 301 : 2eV79 : :r - 



35 A BLASTP search was performed against public protein databases. The results from 

this comparison are shown in Table 4E. 

--- v\V- f '"C^-.-"--3.v3ViC7:C^TA0ACA': ^-i-CCC ^crrTCVrAT': '2 w^-Vj-^'j ; score ^ ;\K .I'l;/^"? 

Saqpiejaeaa-eaccsauaJW- «*9^ (bits) Valtia i--*" ~ 

40 PUWJPia P068j6t7_aWrAOXO«ar <pig>^ plasmittoge^:<ed:',3/:4V31' ;T)*v. i - 102 Se-22 

PLMtf~BOVIN- PG«86^ bo» ; taurua , (boyineH / plaominogen' proauraor". . .:; r. .101 . 2 e-2 1 : 

HEPSUWpb^ C^4^ serine protease heps. . . 98 2e-20 

PLMN~HORSS"P80010'equai» caballuo (horse), plasminogen (ec 3.... 97 3e-20 

PIjMN_MACMU P12545 macaca mulatta (rhesus macaque) . plasminog. . . 96 5e-20 

45 HEPS_RAT Q05511 rattus norvegicus . (rat K . serine protease, hep. . , . - .96 5e-20 

HBPS_HUMAR POS982 hrinio" sapiens 1 (human) : serine protease hope . . . 9€ Se-20 

PLMN_HOMAH POO 7 47 homo sapiens (human) . plasminogen preeurso. . . 96 €o-20 

Q15146 Q15146 homo sapiens (human), plasminogen precursor* 1 — 96 6e-20 

O46507 046507 papio hamadryas (hamadryas baboon), plasminoge . . . 96 8e-20 



For example^aadwwn in Table 4F, the ^ AMF4 jroteih has 48 of 81 amino acid residues 
(59 %) identical to, and 60 of 81 residues (73 %) positive with, the 790 amino acid residue 
long plasminogda from pig (Acc. No. P06867) (SEQ Bb ^6:70). 
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Table 4F. BLAST? of AMF4 against P06867 (SEQ ID NO:70) 

PUOf PIQ P06867 sus tcro£a (pig), plasminogen (sc 3.4*21.7) • 10/1996 Length - 790 
Soor« - 103 bits ,<2S2> , Bxpaofc - 6o-22 

Identities - 48/81 (59%), Positives - 60/81 (73%)', ?. 1/81 (1%) 

5 ■ '**rft3b ^ . 

Ouerv: 4 KPGPVPI*ISNlCICNHRDfVYGGI ISPSMLCAGYliRGGVDS CQGDSGGPLVOQKRRIjWKLVQ 63 

I *l*l ll*ll_* M +M+ 1111*1 11*1111111111111 I* + I I 

• StFjet : 697 KBARI^VIHNXVCMRyBYI/KSKVSPNSLCAGHIJVCGIDS CQQPSQGPLVCFEKDgglLQG 756 
10 Query: 64 ARSFQIGCASVNKPOVY- RVS 83 

I-+MII Hil l I III \~ . . 

Sbjcts 757 VTSWGLOCAI»PHKPavyVRV8 777 ...-_ : -r-. 

15 — In add**"* 1 ^ y.?^ ft ^J.! t ;- T ^ e ,^rfr e .AMF4 protein has 47 of &2 amino acid residues 

(57 %) identical to, and 58 of 82 residues (70 %) positfv^g^^ ' 
long bovine plasminogen precursor (Acc. Nix P06868) (SEQ ID NO:71). 



i- I is 3 i- i 



^ Table4G. BLASTP of AMF4:against P06868 (SEQ ID NO:71) 

20 PLMBJJBOVIW P06868 bos tanras plasminogen prscnvstr r((^3>A^L>;s7X 11*/1$£3: fcs»gfeb-y» . 

'"8i2rl.::rv ; >--.;; \/. BA ! ,:-S ; ";Y ... \ - -i ; • 

Score,.- 101 bits (248) r ; Expect .«* 2a-21 /.\ f -vr , ;t, -y 

Identities - 47/32^ C57%) ,.;Pqpitives-. 58/82 (70%) ^^^s ;^.;i/82 Jl%) ' ^ ;I 

25 Ouerv : 4 KPOFVPI»ISNXI CNHRDVYGG IiaPSMIjCAGYIjRGGVDS CQODSGOPIiVCQBRRIiinCLVa ; 63 N ;G * N \ X : 

_ , : ; , ,1.. ,-hl.*|. !! U*l tw*-,t..* ! l*,J i U.I*.l I I IIIUIMHI H t* :l; I ~: 

."iS Sbjct i ,719 KBftHLPVI JWJiypaia iBYIiI XARVi^^ 778 v 

Query : 64 ARSFGIOOVKVBnCWJVY-KVSP' 8tf - 

3® l + MII Mil I I: I I t - ^CO; -4; n^, tK o.: .••>•,:/. : - : , ,., 

Sbjcts 779 VTSWOI*GC3«POTCPOVWRVfiP SaO- . , :V ; -v. \ ^ 

The presence of identifiable domains in AMF4, as well as all other AMFX proteins, can 
S be determined by searches: using softvrare,^ 

35 Pfaxn, ProDomain, and Prints, and then determining the Interpro number by crossing the 
^3. ' domain match (or numbers) using the Interpro website flrttorwww.ehLoo^V i^terproy 
J DOMAIN results can then be coUecfed firom ^Conserved Domain DataBase (CDD) with 
M Reverse Position Specffiti BLAST '&Mjja&'"TEis BLAST analysis software samples domains 
j| found in tfctf Smart and Pfiam collections. For this DOMAIN sequence arguments, fuUy-. v . 
40 conserved single residues are indicated by black shading "strong" semi-conserved residues are 
-7? indicated by grey. The "strong" group of conserved amino acid residues may be any one of the 
following groups of amino acids: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, 
FYW. AMF4 shows good homology with the consensus sequence of the trypsin-like serine 
protease domain (Smart|Tryp_SPc, E- 2e-21) and the trypsin domain (Pfam00089, E= 2e-14). 
45 The alignment with the trypsin-like serine protease domain (SEQ ID NO:72)(labeled 
> "Consensus") is shown in Table 4H. 
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TABLE 4BL DOMAIN ANALYSIS FOR AMF4 - ALIGNMENT WITH TRYPSIN- 
LIKE SERINE PROTEASE DOMAIN (SEQ ID NO:72) 

— | 1 — | 1 .... I 1 1 1 1 1 — I — I 

Consensu RZVOGaBaHiaSPTOUVSLCfXRGGG-KHFOGQSLlSPRVfVLTAAHCVYaSD SS 52 

iXF4 V- 1 

Consensu* TJtVRIA3SHDCt*SSGBBT QTVKVSKVIVHPNYKP- - STYTj^I^TiTiKT<KKFVTI>8DTVR X07, 

&2CF-4 : 1 

; ,i i . . . . iv: : . i J. My. . , . i . JiL . f : ; /.i ^ . 

CanBMfUl PXd»P3a OYNVPJUJTl'C'JVSawaRTSB - SQOSLPDTLQffVNMrCTMnJTTO-H 158 

) ; ; ■ . b > . I vxv .. h » ■ • I * I • - * U * - r 1 -* ■ I.-. -. ■> It - - r I : 

— AYSQ&A|ttninSB$lB — ES I( B fl ^^»ffl35Bltt JI * , ~""""S"B v ffll IV B*^ ao7 ~ 



AMF4 ; : : T-™*H 3 *T W HiH^^ 

The, tryp^-like serine protease domainjs present \n. a large fen^y of jprpteins, 
S including many that are synthesized as inactive precursor zymogens that are cleaved during 
limited proteolysis _ 

AMF4 has similarity to plasminogens. Plasmin dissolves the fibrin of blood clots and 
acts as a proteolytic factor in a variety of other processes including anbryonic development, 
tissue remodeling, tumor invasion, and inflammation; in ovulation it weakens this walls of the " 
10 graafian fb^^ 

several complement zymogens, such as cl and c5. it cleaves fibrin, fibronectin, 
thromtx>spondiivl s " n ^^ orviizc- ^ ;^ : v; 

L Plasminogen is the zymogen in uie circulating blood from wmch plasmin is; formed- : _ 
Pl flCThiwrt gen ifl fl flTnglB^rmig glyc op i o te m witn 790 amino acid residues. Activation to the. . 
1 5 active form; plasmin^ by;u*qk£u^(Gn^^ fcl^tence in Man ^OMIM") Acc. No, 

191840) mvolves cleavage^ 5fS0jua4 561, resxdting in the 

formation of the 2-chain plasmin molecule held together by 2 disulfide linkages. The heavier 
chain contains about 41 1 residues and the Lighter chain about 233. The main function of 
plasmin is the Question of fibrin in blood clots, Plasmin is a pro^lytic enzyme with a 
20 specificity similar to that of trypsin. Like trypsin, plasmin belongs to the family of serine 
proteinases, in which the active site catalytic triad, His^57, Asp-102, and Ser-1 95 
(chymotrypsin numbering), is situated in the light chain. 

Thft p rfl<CTTiiTih g^ ^trvation system is one pathway that has been consistently 
implicated in cancer. Its relevance to cancer extends from being responsible for many of the 
25 hemorrhagic episodes that occur in cancer patients to being fundamental to many, if not all of 
the molecular mechanisms that define tumor progression: Extravasati w and intarvasation of 
• . ^ : : - 23 



™« RAGE BUNK (uoto, 



WO 01/74897 



PCT/USO 1/10892 



solid malignant tumors is controlled by attachment of tumor cells to components of the 
basement membrane and the extracellular matrix, by local proteolysis and tumor cell 
migration. Strong clinical and experimental evidence has accumulated that the tumor- 
associated serine protease plasmin, its activator uPA (urokinase-type plasminogen activator), 
5 the receptor uPA-R (CD87), and the inhibitors PAI-1 and PAI-2 are linked to cancer invasion 
- and metastasis. In cancer, increase of uPA, uPA-R, and/or PAI-L is associated with tumor 
progression ami with shortened disease-tree and/or overall survival in patients afflicted wife . 
malignant solid tumors. uPA and/or its inhibitor PAL- 1 appear to be one of the strongest 
prognostic markers so fkr described. Strong pxpgnostic vdue to predict diseases rwtir^ 

10 overall survival has been documented for patients with cancer of the breast, ovary , cervix, . 
, endometriinn, stomach, colon, lung, bladder, kidney, brain, and soft-tissue. Due to the strong 
correlation between elevated uPA and/or PAI-1 values in primary cancer tissues and the tumor 
invasion/ metastasis capacity of cancer cells, proteolytic factors have been selected as targets 
for therapy. 

15 " A r^ert an^ogim^ 

Lewis lung <^rcinoma \vas isolated and designated the inhibitor angiostatin. See, O'Reilly 
: etal. 1994e*/J79i^ 
containing at least 3 of the kxingles of plasminogen. I^ombinarjlJragmeiits of angiostatin: 
show inhibitoiy activir^^ 

20 Angiostatin is produced by the proteolytie cleavage of plasminogen by a serine pro tease 

produced r^sever^hurj^nt^ lines. See,e.g., Qately e/ al. 1996 Capper 

.. Res* 56: 48 87-4890* A shift of bsJmcs of ttimor anpogene^is by gerje transfer of a cDNA 
coding for. mouse angiostatin into murine T241 fibrosarcoma cells suppresses primary and 
metastatic tumor growth in vivo. See, e.g.. Cap et al. 1998 J, Clin, Invest. 101 : 1055-1063. 

25 Implementation of stable clones expressing mouse angiostatin in C57B J 6/J mice inhibited 

primary tumor growth by an average of 77%. After removal of primary tumors, the pulmonary 
micrometastases in approximately 70% of mice remained in a microscopic dormant and 
avascular state for 2 to 5 months. The tumor cells in the dormant micrometastases exhibited a 
high rate of a^ptosis balanc«i by a high proliferation ratb. These! studies showed the 

30 Htmtrrishe d growth of limg metas tase s after removal of the primary tumor, suggesting that 
metastases are self-inhibitory try halting angiogenesis. The data may also provide a novel 
approach for cancer therapy by anti-angiogemc gene therapy with a specific angiogenesis 
inhibitor; The .^mgiostaiirj^indfuced long-term dormancy of lung metastases was equivalent to 14 
to 15 human years (wheq 1 mouse day is equivalent to approximately 35 human days). 

. . ... . : , ■■ •• • 24 • - . ■■ - 
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Overexpression of AMF4 in concert with a plasminogen activator such as uPA 
(urokinase) could potentially stimulate tumor cell invasion and migration. Alternatively, 
AMF4 could serve as a substrate for an unidentified serine protease fliHn to the protease that 
cleaves plasminogen to angiostatin. In this manner, tumor cells might limit the production of 
5 this important anti-angiogenic factor. 

Therapeutic t a rgeting of AMF4 is anticipated to limit or bloctthe extent of tumor cell 
invasion/motility and metastasis. Potentially therapeutic targeting of AMF4 might shift the 
balance in favor of the production of angiostatin or a similar molecule with anti-angiogenic . 

activity.,.. . ^, .„ ^ 

10 Expression information for AMFX RNA was derived using tissue sources including, 

but not limited to, proprietary database sources, public EST sources, literature sources, and/or 

RACE sources, as described in the Examples. 

. The nuclbic acids and proteins of A2vlF4 are useful fin pbtcntiai &erapeutic applications 

implicated in various AMF-rclaled patho For exainple, a cDNA n " ' 

15 encoding the trypsin-like serine protease prb^^a^ b^ list^ - - 

trypsin-like serine protease protein may be useful who* adniiinsttt^ ^ 

thereof. The novel imcieic^ 

useful in diagnostic applications, wherein thepresenee or ainounlof the nuclei&acid or the: ■ 
protein are to be assessed: ; These materials are further useful in the generation of antibodies 

20 that bind immunospecitically to the novel substances of the invention for use in therapeutic or 
diagnostic me&66& r r: - a ::: s i ^^i^io) 

The AMFX nucleic acids and proteins a*e useful in notcqti^di^gpQ^ 
applications implicated in various diseases and disordep described b«low an^ r 
pathologies. For example, the compositions of the presort J^mtion win have efficacy for 

25 treatment of patients suffering from: cancer, blood clotting disorders and other diseases, 

disorders and conditions of the like: By way of nonlimiting example, the compositions of the 
present invention will have efficacy for treatment of patients suffering from cancer, blood 
clotting disorders. Additional AMF-related diseases and disorders are mentioned throughout 
the Specification; : — — 

30 Further , uie protein similarity 

AMF4 suggests that AMF4 may have important structural and/or physiological functions 
characteristic of the trypsin-like serine protease fktnily. Therefore, the nucleic acids and 
proteins of the invention are useful in potential diagnostic and therapeutic applications and as a 
research tooL These include serving as a specific or selective nucleic and or protein diagnostic 
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and/or prognostic marker, wherein the presence or amount of the nucleic add or the protein are 
to be assessed, as well as potential therapeutic applications such as the following: (i) a protein 
therapeutic, (ii) a small molecule drug target, (iii) an antibody target (therapeutic, diagnostic, 
drug targeting/cytotoxic antibody), (iv) a nucleic acid useiul in gene therapy (gene 

5 delivery/gene ablation), and (v) a composition promoting tissue regeneration in vitro and in 
vivo (vi) biological defense weapon. 

These materials «re fur&^ 
immunospecificaHy V 
• methods/These antibodies^^^ 

10 predictfbnftdmhychroph^^ 

below. In various embocfime^ AMF4 epitopes are hydiophiliare^onsrpf the 

AMF4 polypeptide aspredfct^b^^ 

hydrophobicity crhydrophiHcj^plote; , : ^l-7^j-r--cc- — r - : - - - :~ 

AMF-5 (aho refarreito as Ace No^ 2969138?) 

15 Novel AMF5 is an organic anion Unsporting peptide-like protein ("OTAP") protein. 

The AMF5 clone is alternatively tefsnedta^am AMF5 nucleic 

acid of 2646 nucleotides is shown in-Table 5A» -llei AMFS <>p« re«ii^_feme ("ORF") 
begins at nucleotides 3*5. AMH5 app^tobe an intern^ fragrant, so it is contemplated that 
the OKF could extend beyond fhe.N- and C- teroii±dqpict^ in Tables SA and 5B A? shown 

20 in Table 5A, the fii^ coding;^ ..... ...... . ;c r; ( . 

Table 5 A» AMF5 nucleotide seqneMe(SEQ ID NOdK^ 



AGTCAGTGGGftACCftG^CT 
25 owBDtt!^^ 

• a acTCTTTOTTC'^ 

The encoded AMF5 protein (SEQ ID NO: 10) is a 136 amino acid protein shown in 
30 Table 5B. 

^ Table SB. AMFS amino acid sequence (SEQ ID NO:10) 



GIKKHTVFYNCS' 



CVENTTGMNRNYSAHI/ffiCPRD^^ 



35 In an analysis of public nucleic acid sequence databases, it was found, for example, that 

theAMFS nucleic acid sequence has?363 of 374 bases (97%) identical to a Homo sapiens 
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NOVEL PROTEINS AND NUCLEIC ACIDS ENCODING SAME 

FIELD OF THE INVENTION 

The invention generally relates to novel AMF1 , AMF2, AMF3, AMF4, AMF5, AMF6, 
AMF7, AMF8, AMF9 and AMF10 nucleic acids and polypeptides encoded therefrom. More 
5 specifically, the invention relates to nucleic acids encoding novel* polypeptides, as well as 

vectors, host cells, antibodies, and recombinant methods for producing these nucleic acids and 
polypeptides. 

BACKGROUND 

A need exists for diagnosis, prognosis, and prophylactic or therapeutic treatments of 
10 disorders and diseases whose underlying mechanism relates to cell-cell interactions via 

molecules expressed on the cell surface. Such diseases and disorders include those related to 
the modulation of cell movement, cell signal processing, cell adhesion or cell migration 
pathways, including, but not limited to, tissue remodeling, proliferative diseases, cancer, tumor 
invasion and metastasis, developmental processes, connective tissue regulation, and effects of 
15 other extracellular microenvirons. This inventibffpiWvfiles iriethods and compositions to fill 
this need. ;*;: , . 

SUMMARY OF THE INVENTION 

The invention is based in part upon the discovery of novel nucleic acid sequences 
encoding novel polypeptides. The disclosed AMF1, AMF2, AMF3, AMF4, AMF5, AMF6, 
20 AMF7, AMF8, AMF9 and AMF10 nucleic acids and polypeptides encoded therefrom, as well 
as derivatives, homologs, analogs and fragments thereof, will hereinafter be collectively . 
designated as "AMFX" nucleic acid or polypeptide sequences. 

In one aspect, the invention provides an isolated AMFX nucleic acid molecule 
encoding a AMFX polypeptide that includes a nucleic acid sequence that has identity to the 
25 nucleic acids disclosed in SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17 and 19. In some 

embodiments, the AMFX nucleic acid molecule will hybridize under stringent conditions to a 
nucleic acid sequence complementary to a nucleic acid molecule that includes a protein-coding 
sequence of a AMFX nucleic acid sequence. The invention also includes an isolated nucleic 

1 
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acid that encodes a AMFX polypeptide, or a fragment, homolog, analog or derivative thereof. 
For example, the nucleic acid can encode a polypeptide at least 80% identical to a polypeptide 
comprising the amino acid sequences of SEQ ID NOS:2, 4, 6, 8; 10, 12, 14, 16, 18 and 20. The 
nucleic acid can be, for example, a genomic DNA fragment or a cDNA molecule that includes 
the nucleic acid sequence of any of SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17 and 19. 

Also incited in the invention is an oligqnucleotidej e.g. 9 ati oligonucleotide which 
includes at Je^^ QQntiguous nucleotides of a AMFX nucleic acid (e.g. , SEQ ID NOS: 1, 3, 5, 
7, 9, j 1, 13, 15^17 and 19) or a implement ^qf said oligonucleotide: ; _ . _ . 

. , Also mclude^ AMFX polypeptides (SEQ ID 

*vTO: 2* 4, 6, 8, 10* 12;i4»:l&^^ the AMFX polypeptides 

include an amino apid sequence th^ is su^ acid sequence of a 

human AMfXpolj^ptide;^ , 

_ _ The invention also features antibodies that immunoselectively-binds to AMFX 
polypeptides, or fra^ent^to In one embodiment of 

the invention, the anti-AMFX antibody is polyclonal. In another embodiment of the invention, 
the anti-AMFX antibody is monoclonal. In oilier embodmrats'of ih^^o^the anti- 
AMFX antibody is therapeutic. ~ 

In another aspect; the; inventio^ra 
therapeutically^ or prophylactic^ly-eflFective^amouiits of a therapeutic and a pharmaceutically- 
acceptable carrier. -The therapeutic canibe? &gJ 9 lkl&MBX nucleic acid, a AMFX polypeptide, 
or an antibody specifiC"foi ^^ AA^X-polypeptrde.: In a further aspe^ the invention includes, in 
one or more containers, a thera^uticaUy- dr j^o amount of this 

pharmaceutical: composition .?v ::r ' ■ ] 4 ^mmo ^.lcs r ..;:~:izsl; : 

in. a. further aspect^ the invention inelu<tes;^^ 
culturing aicell that includes a ;AMFX nucleic acid, under conditions allowing for expression of 
the AMFX polypeptide encoded by the DNA. If desired, the AMFX polypeptide can then be 
recovered. . ,\_ . 

In anoth^aspect, the invention includes a method of detecting the presence of a 
AMFX polypeptide in a sample. Lithe method, a sample is contacted with a compound that 
selectively binds, to the polypeptide under conditions allowing for formation of a complex 
between the polypeptide and the compound. The complex is detected, if present, thereby 
identifying the AMFX polypeptide within the sample. 

The invention also includes methods to identify specific cell or tissue types based on 
their expression of a AMFX. 
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Also included in the invention is a method of detecting the presence of a AMFX 
nucleic acid molecule in a sample by contacting the sample with a AMFX nucleic acid probe 
or primer, and detecting whether the nucleic acid probe or primer bound to a AMFX nucleic 
acid molecule in the sample. 
5 In a further aspect, the invention provides a method for modulating the activity of a 

AMFX polypeptide by contacting a cell sample that includes the AMFX polypeptide with a 
compound that binds to the AMFX polypeptide in an amount sufficient to modulate the activity 
of said polypeptide. The compound can be, e.g. , a small molecule, such as a nucleic acid, 
peptide, polypeptide, peptidomimetic, carbohydrate, lipid or other organic (carbon containing) 

10 or inorganic molecule, as further described herein. 

Also within the scope of the invention is the use of a Therapeutic in the manufacture of 
a medicament for treating or preventing disorders or syndromes including, e.g., disorders 
related to cell signal processing, cell adhesion or migration pathway modulation, including, but 
not limited to, chemoresistance, radiotherapy resistance, survival in trophic factor limited 

15 secondary tissue site microenvironments, connective tissue disorders, tissue remodeling, 

oncogenesis, cancer of the breast, ovary, cervix, prostate, endometrium, stomach, colon, lung, 
bladder, kidney, brain, and soft-tissue, cellular transformation, developmental tissue 
remodeling, inflammation, blood clot formation and resorption, hematopoiesis, angiogenesis, 
multidrug resistance related to organicjouon transporters, malignant disease progression, 

20 ^J^B^^^P^^^g^ll^ft^of ^^^ w §}vff illular res P onses to external stimuli. In 

contemplated embodiments, successful targetini^pf AMFX polypeptides using an anti-AMFX 
monoclonal antibody is anticipated to have an. inhibitory effect on tumor growth, and other 
AMFX-related diseases and disorders. The Therapeutic can be, e.g., a AMFX nucleic acid, a 
AMFX polypeptide, or a AMFX-specific antibody, or biologically-active derivatives or 

25 fragments thereof. 

The invention further includes a method for screening for a modulator of disorders or 
syndromes including, e.g., disorders related to cell signal processing, cell adhesion or 
migration pathway modulation, including, but not limited to, chemoresistance, radiotherapy 
resistance, survival in trophic factor limited secondary tissue site microenvironments, 

30 connective tissue disorders, tissue remodeling, oncogenesis, cancer of the breast, ovary, cervix, 
prostate, endometrium, stomach, colon, lung, bladder, kidney, brain, and soft-tissue, cellular 
transformation, developmental tissue remodeling, inflammation, blood clot formation and 
resorption, hematopoiesis, angiogenesis, multidrug resistance related to organic anion 
transporters, malignant disease progression, autocrine and paracrine regulation of cell growth, 

3 
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cellular responses to external stimuli. The method includes contacting a test compound with a 
AMFX polypeptide and determining if the test compound binds to said AMFX polypeptide. 
Binding of the test compound to the AMFX polypeptide indicates the test compound is a 
modulator of activity, or of latency or predisposition to the aforementioned disorders or 
5 syndromes. In- one embodiment; the test compound is a anti-AMFX antibody. 

Alsb vidthin the scope of the invention is a method for screening for a modulator of 
activity, or of latency or predisposition to an disorders or syndromes including, e.g. , disorders 
related ta cell signal processing, cell adhesion or migration pathway Modulation* including, but 
not limited to, chemoresistance, radiotherapy resistances survival it* trophic factor limited 

\0 second^ tissue site nricrbOTV^ 

oncogenesis, cancer of the to stomach, colon, lung, 

bladder, kidney,- brainy and soft-tissue, cellular transformation, developmental tissue ^ 
remodeling; inflammation, blood clot formation and fe^orptionf hematopeiesis- arigiogenesis, 
multidrug resistance related to organic anion ti^spbrters* malignant disease progression, 

1 5 autocrine and paracrine regulation of cell growth', cellular responses to eternal stimuli, by 
administering a test compound to a test aniih&l^at increased risk for the^aforementioned 
disorders orsyndrorras; ; -The test animal- expresses a by a 

AMFX nucleic acid. Expression or activity of AMFX polypeptide is then measured in the test 
animal, as is expression or activity of the protein in a control animal which recombinantly- 

20 expresses AMFX polypeptide and is not at increased risk forthe disorder or syndrome.; Next, 
the expression of AMFX polypeptide in bbththe test 1 animal' and thecontrol animal is 
compared: A change in the activity of AMFX polypeptide in the test animal relative to the 
control animal indicates the test compound is a modulator of latency of the disorder or 
syndrome; 

25 In yet another aspect, the invention includes a method for determining the presence of 

or predisposition to a disease associated with altered levels of a AMFX polypeptide, a AMFX 
nucleic acid, or both, in a subject (eg:, a human subject); The method includes measuring the 
amount of the AMFX polypeptide in a test sample from the subject and comparing the amount 
of the polypeptide in the test sample to the amount of the AMFX polypeptide present in a 

30 control sample. An alteration in the level of the AMFX polypeptide in the test sample as 

compared to the control sample indicates the presence of or predisposition to a disease in the 
subject; Preferably, the predisposition includes, e.g., disorders related to cell signal processing, 
cell adhesion or migration pathway modulation, including, but not limited to, chemoresistance, 
radiotherapy resistance, survival in trophic factor limited secondary tissue site 

4 
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microenvironments, connective tissue disorders, tissue remodeling, oncogenesis, cancer of the 
breast, ovary, cervix, prostate, endometrium, stomach, colon, lung, bladder, kidney, brain, and 
soft-tissue, cellular transformation, developmental tissue remodeling, inflammation, blood clot 
formation and resorption, hematopoiesis, angiogenesis, multidrug resistance related to organic 
anion transporters, malignant disease progression, autocrine and paracrine regulation of cell 
growth, cellular responses to external stimuli. Also, the expression levels of the new 
polypeptides of the invention can be used in a method to screen for various cancers as well as 
to determine, the stage of cancers. ; _ 

In a further aspect, the invention includes a method of treating or preventing a 
pathological condition associated with a disorder in a mammal by administering to the subject 
a AMFX polypeptide, a AMFX nucleic acid, or a AMFX-specific antibody to a subject (e.g., a 
human subject), in an amount sufficient to alleviate or prevent the pathological condition. In 
preferred embodiments, the disorder, including, e.g., disorders related to cell signal processing, 
cell adhesion or migration pathway modulation, for example, but not limited to, 
chemoresistance, radiotherapy resistance, survival in trophic factor limited secondary tissue 
site microenvironments, connective tissue disorders, tissue remodeling, oncogenesis, cancer of 
the breast, ovary, cervix, prostate, endometrium, stomach; colon, lung, bladder, kidney, brain, 
and soft-tissue, cellular transformation, developmental tissue remodeling, inflammation, blood 
clot formation and resorption^ hematopoiesis, angiogenesis, multidrug resistance related to 

- AC CG 'TGTC 

organic anion tramporterSjcmabgnant disced progression, autocrine and paracrine regulation 
of cell growth, cellular re^j^s to e>rte^^timuli. 

In yet another aspd&£ the invention can be used in a method to identity the cellular 
receptors and downstream effectors of the invention by any one of a number of techniques 
commonly employed in the art. These include but are not limited to the two-hybrid system, 
affinity purification, co-precipitation with antibodies or other specific-interacting molecules. 

Unless otherwise defined, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although methods and materials similar or equivalent to those described herein can 
be used in the practice or testing of the present invention, suitable methods and materials are 
described below. All publications, patent applications, patents, and other references mentioned 
herein are incorporated by reference in their entirety. In the case of conflict, the present 
specification, including definitions, will control. In addition, the materials, methods, and 
examples are illustrative only and not intended to be limiting. 
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Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

DETAILED DESCRIPTION 

The invention is, based, in part, upon the discovery of novel nucleic acid sequences that 
5 encode novel polypeptides. The novel nucleic acids and their encoded polypeptides are 
referred to individually as AMF1, AMF2, AMF3, AMF4, AMF5, AMF6, AMF7, AMF8, 
AMF9 and AMF 10. "Thfnucfeic acids; and thett ^ncb^edj^i^ep^des, 1 khTcollectively 
designated herein as "AMF^J ' "* v "~" 

The novel AMEX nucleic acids of the invention include the nucleic acids whose 
10 sequences are pravddedinT^leMA^ or a 

fragment, derivative,.ahak>giMihQmolog th£^fi:Tlte:i^^ invention 
include the protein fragments whosesequences are provided in Tables IB, 2B, 3B, 4B, 5B, 6B, 
7B, 8A, 9A aikLl OA inclusive^ and proteins are described 

below. Within the scope: of this invention is a method of using these nucleic acids and peptides 
15 in the treatment or prevention of :-a disorder related to cell signal processing, cell adhesion or 
migration pathway modulation. - h:k- ■-•d ^q:--.: UouieU^y ^ . i r .u : ^„ 

AMF-1 (also referred to as Ace No7 1420^1 0.0.21^ 

Novel AMFI is a fibnlfin-lie protein. "The AMFi clone is alternatively referred to 
herein as Ace; Nd;~ 14209^ 

20 nucleotides is shown in Table IA£ The AMFI open reading frame^CORFO begins at 

nucleotides 208-21Q: The AMFt Ol^ tenninates at a TGA codon at nucleotides 1699-1701. 
In one embodmenVthe-AMFT^ WHEREIN it is 

contemplated tl^at the AMFI ORF extends beyond the N-terminus shown in Table 1A, /.<?., the 
sequence demarcated by the solid underline is intron sequence that is later spliced out when the 

25 mature full length mRNA is formed. In an alternative embodiment, the AMFI ORF begins at 
the in-frame ATG start codon at position 472-474 of SEQ ID NO: 1 . In this alternative 
embodiment, the 5* UT sequence (demarcated by the solid and dashed underline) would extend 
to this ATG. As shown in Tabte 1 A, putative '5* intron re^ori (or ahematively, die 5* 
untranslated regions) and the putative untranslated region 3 * to thestbp codon are underlined, 

30 and the putative start and stop codons are in bold letters. _ 

Table 1 A- AMFI nucleotide sequence (SEQ ID NO:l). 

CGGATGACTCCCQAQAAGGTQAQCCCCTCa^ 
ACTGCCTGGGTCTCTrGAGTTTCAGGCTCCCCC^ 

6 
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CTTTTGT CCCCTTGGGGCCTTGAGTGTGGCraGGGC^ 



10 



15 



20 



TTTGAGJ^CJTGG 

GCCACAATACCGAGGGCTCCTACCTGTGCACCTGTCCAGCCGGCTACACCCTGCGGGAGGACGGGGCCATGTGTCG 

AGATGTGGACGAGTGTGCAGATGGTCAGCAGGACTGCCACGCCCGGGG(^TGGAGTGC^GAACCTCATCGGTACC 

TTCGCGTGCGTCTGTCCCCCAGGCATGOSGCCCCTGCCTGGCTCTGGGGAGGGCrGCACAGATGACAATGAATGCC 

ACGCTCAGCCTGACCTCTGTGTCAACGGCCGCTGTGTCAACACCGCGGGCAGCTTCCGGTGCGACTGTGATGAGGG 

ATTCCAGCCCAGCCCCACCCTTACCGAGTGCCACGACATCCGGCAGGGGCCCTGCTTTGCCGAGGTGCTGCAGACC 

ATGTGCCGGTCTCTGTCCAGCAGCAGTGAGGCTGTCACCAGGGCCGAGTGCTGCTGTGGGGGTGGCCGGGGCTGGG 

GGCCCCGCTGCGAGCTCTGTCCCCTGCCCGGCACCTCTGCCTACAGGAAGCTGTGCCCCCATGGCTCAGGCTACAC 

TGCTGAGGGCCGAGATGTAGATGAATGCCGTATGCTTGCTCACCTGTGTGCTCATGGGGAGTGCATCAACAGCCTT 

GGCTCCTTCCGCTGCCACTGTCAGGCCGGGTACACACCGGATGCTACTGOTAC 

GCAGCCAGGTCCCCAAGCCATGTACCTTCCTCT^ 

CTACCTGCTGGAGGAGGATGGCAGGACCTGCAAAGACCTGGA03AATGCACCTCCCGGCAGCACAACTGTCAGTTC 
CTCTGTGTGAACACTGTGGGCGCCTTCACCTGCCGCTGTCCACCCGGCTTCACCCAGCACCACCAGGCCTGCTTCG 
ACAATGATGAGTGCTCAGCCCAGCCTGGCCCATGTGGTGC 

ctgtgaatgccaccju^ggctt^ 

caccgctgccagcatggctgtcagaaccagctagggggctaccgctgcagctgcccccagggtttcacccagcact 
cccagtgggcccagtgtgtgggtgagtg aaaagggct^^ 

AGGCGACTAACAGACGCC ACC CT GGRAGATGATGTGACAAGCACAAl^ATCTAAAGATTG AACAGGCC AGC C C AGA 
AGATGAGAATGAGTGTGCCCTGTCGCCC 



25 The 497 aa AMF1 protein (SEQ ID NO:2), is shown in Table IB. In an alternative 

embodiment, the AMF1 ORF begins at the first in-frame ATG encoding a methionine at 
position 89 in SEQ ID NO:2, shown bolded and underlined in Table IB. 

Table IB. AMF1 amino acid sequence (SEQ ID NO:2). 

QKP CLQRLPS ADVNECAENPGVCTNGVCVNTDGS FRCECPFGYSIiDFTG INcfvDTDECSVGHPCGQGTCTNVIGGF 
30 ECACADGFEPGI^TCEDIDECSIJXrPLLCAFRC 
CKOTiIGTFACTCPPGMRPLPGSGEGCTDDNE 

FAEVLQTMCRSLSSSSEAVTRAECCCGGGRGWGPRGELCPLPGTSAYRKLCPH 

GEC INSLGS FRCHCQAGYTPDATATTCLDMDECSQVPKPCTFLCKNTKGS FLCSCPRGYLLEEDGRTCKDLiDECTS 
RQHNCQFLCVNTVGAFTCRCPPGFTQHHQACro^ 
3 5 VNE CDGPHRCQHGCQNQLGGYRCS CPQGFTQHS QWAQCVGE 

In an analysis of public nucleic acid sequence databases, it was found, for example, that 
the AMF1 nucleic acid sequence has a 238 base fragment with 194 of 238 bases (81%) and a 
197 base fragment with 156 of 197 bases (79%) identical to Ma? musculus fibrillin 2 (fbn2) 

40 gene, complete cds (GehBank Acc. No. L39790) (SEQ ID NO:61) shown in Table 1C. In all 
BLAST alignments herein, the "E-value" or "Expect" value is a numeric indication of the 
probability that the aligned sequences could have achieved their similarity to the BLAST query 
sequence by chance alone, within the database that was searched. For example, as shown in 
Table 1C, the probability that the subject ("Sbjct") retrieved from the AMF1 BLAST analysis, 

45 in this case the Mus musculus fibrillin 2 (fbn2) gene, complete cds, matched the Query AMF1 
sequence purely by chance is 1 in 9x1 0 26 (i.e., a probability of 9x1 0" 26 ) for the first fragment 
and 1 in 7x1 0 8 for the second fragment. 
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Table 1C: BLASTN of AMFl against Mus fbn 2 (SEQ ID NOs:61 and 62) 

>MUSFBN2 1*39790 Mus mus cuius fibrillin 2 (fbn2) gene, complete cds. 8/1995 
Length =* 9859, Strand = Plus / Plus 
Score = 125 bits (63), Expect = 9e-26 
Identities = 194/238 (81%) 

Sbjct: nucleotides 6542-6779 (SEQ ID NO:61) 
Query: 293 tcaacaccgatggatccttccgctgtgagtgtccctttggctacagcctggacttcactg 3 52 

i n i in ii n ni i inn 1 1 ii hi 1 1 il i linn i min i him 

Sbjct: 6542 tcaacactgatggatctttccgatgtgagtgtccaatgggctacaacctggattacactg 6601 
Query: 353 gcaCcaactgtgtggacacagacgagtgctctgtcggccacccctgtgggcaagggacat 412 

I II IMI-miHI-Mrillllfll - IIHI 1 1 K I III II I lllllll 

Sbjct: 6602 gagtccggtgtgtggacactgacgagtgctccatcggcaacccntgcgggaacgggacat 6661 
Ovfery : 413 ' gcaccaal:cft cattg^ t ca 4 72 

.. .. ". lllllll. II lllll' 1 1 LI 1 1.1.1 1 1 . I.IJI .Ml Jill lllllll II III 

Sbjct: 6662 gcaccaacgtgatcgggt§cttcgaatg 6721 
Query: 473 tgatgacctgcgaggacatcgacgaatgctccctgaacccgctgctct^rtgccttccg 530 

: . . mil i :miii 1 1 1 il i ii 1 1.. n in mi n in 1 1 ri.j i m nm 

Sbjct:: 6722_ ^gatgaactgcgaagacatcaacg^ 6779 

Strand "=r £luia F Pltis " J " y - " r ~~ r - r - 

Score = 65.9 bits C33 V, ' aqpect" 7e-08" .*'" " : -- J =-;- 

Identities = 156/197 (79%) 

Sbjct : .nucleotides 7477-7673 . (SEQ ID NO: 62) 
Query: 1231 aagccatgtaccttcctctgcaaaaacacgaagggcagtttcctgtgcagctgtccccga 1290 

• - ! riniprr-Hi ir u ni : n -n m nrmr-i m n in i i it ti n: •■ ■• 

Sbjct: 7477 aagccatgcaacttcatctgcaagaacaccaagggcagttaccagtgctcctgcccacgg 7536 
Query: 1291 ggctacctgctggaggaggatggcaggacctgcaaagacctggacgaatgcacctcccgg 1350 

• - i i 'i ii' i- rth i i in 1 1 "ii ; i t ii nmmm .i m n ttv . n 

Sbjct: 7537 gggtacgtcctgcaggaggacggaaagacgtgcaaagacctcgacgaatgtcaaaccaaa 7596 
Query: 1351 cagcacaactgtcagttcctctgtgtcaacactgtgggcgccttcacctgccgctgtcca 1410 

n n mink i i mi i i n hi i n i in in i- 1: nmi i i i n 1 1 ... 

Sbjct: 7597 cagcacaactgccagttcctctgtgtcaacaccctggggggattcacctgtaaatgtccg 7656 
Query: 1411 cccggcttcacccagca 1427 

Sb j ct : 7657 cccggtttcacccagca 7673 (SEQ ID NO:62) 

In addition, the AMF 1 nucleic acid sequence has strong homology to other nucleic 
acids as shown in the BlastN results in Table ID, _ 

v.-re-nTo^ ce: - Table ID. BLASTN alignment data of AMFl 

: , ^ : ' , Scora. , . E 

Sequences producing significant alignments s (bits) Value 

MUSFBN2 L39790. Mus musculus fibrillin 2 (fbn2); gene , complet . . . . 125 9e-26 

MMU20217 U20217 Mua : musculus fibrillin- 2 ' mHNA f ~ partial" cds ... . 115 9e-23 

HUMFIBRLLN: L13923 Homo sapiens fibrillin .mRNA, complete -cds 72 le-09 

HSFIBRMR~X63556 H. sapiens mRNA for fibrillin. 2/1997 72 le-09 

AF1 8 7554 AF1 8 7554.. Homo, sapiens sperm antigen- 36 mRNA,. . comple . . . 72... Ier09 

AF135060 AF135060 Rat t us norvegicus f ibrillin- 2 mRNA, comple. . . 66 7e-08 

AF073800 AF073800. Sus. scrof a fibrillin-! precursor -, ( FBN1 ) mR. . . 58. 2e-0S 

AF13S05*- AF135059 Rattus norvegicus /fibrillin- 1 mRNA, comple... 56 7e-05 

A BLASTP search was performed against public protein databases. As shown in Table 
IE, the AMFl protein has 137 of 349 amino acid residues (39%) identical to, and 200 of 349 
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residues (57%) positive with, the 492 amino acid residue long Homo sapiens transmembrane 
protease, serine 2 (ec 3.4.21.-.) (SEQ ID NO:63). 

Tabic IE. BLASTP of AMF1 against TMS 2 (SEQ ID NO:63) 

TMS2_HOMAM homo sapiens transmembrane protease, serine 2 (ec 3.4 21 -) 7/1998 
Length = 492 

- Score = 266.0, bits (673.0), Expect = le-70 

Identities = 137/349 (39%), Positives = 200/349, (57%) 

CVRFDWDKSLLKIYSGSSHQWLPICSSNWNDSYSEKTCQQLGFESAHRTTEVAHRDFANS 60 
HI + I++M. I l + l |+ +I+++ + + + | + | 

CVRLYGPNFILQMYSSQRKSWHPVCQDDWNE^JYGRAACRDMGYKNNFYSSQGIVT>D- 206 

FSILRYNST IQESLHRSE-CPSQRYISLQCSHCGLR- - - AMTGR I VGG ALAS DS K 111 

l ++ I + 1+ 1+ I l + - + 11 + 1 11+ + - Mm | 

TSFMKLNTSAGNVDIYKKLYHSDACSSKAVVSLRCLACGVNLNSSRQSRIVGG2SALPGA 266 
WPWQVSLHFGTTHICC^TLIDAQWVLTAMC^FVTREKVLEG- - -WKVYAGTSNLHQLPE 168 

II 1 1 1 1 1 1 --1 + 111*+] + I++UIH Ml- I . +|| : . + 

WPWQVS1JWQNVHVCGGSIITPEWIVTAAHCV E KPLNNPWHWTAFAG I LRQS FMFY 322 

AAS- - IAEI I INSNYTDEEDDYDIALMRLS KPLTLSAHIHPACLPMHGQTFSLNETCW IT 226 

-I. + ++I + II .+ + IIIIJ + I..LI.II + + | III . | . + UK 

GAGYQVQKVISHPNYDSKTKNNDIALMKLQKPLTFNDLVKPVCLPNPGMMLQPEQLCWIS 382 
GFGKTRETDDKTSPFLREVQVNLIDFKKCNDYLVYDSYLTPRP^CA 286 

iti i i. 1 1 j i +i n+ ++M 1.11+ +ii i+ui i+i i nun 

GWGATEEKG-KTSEVLMAAK\TLLIETQRCNSRYVYDNLIT^ 441 
GGPLVCEQNNRWYLAGVTSWGTGCGQRNKPGVYTKVTE^ 33 S " 

Mill II l + l I IIII + II.+ +IIII |. in +|++ 

GGPLVTSNI^IWWLIGI^SWGSGCAKAY^RPGvYGNVMVFTDWIYRQMKA 490 (SEQ ID 

AMF1 also has high homology to a number of other amino acid sequences as shown in 
the BLASTP alignment data in Table IF. 
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Query : 


1 




Sbjct : 


148 




Query: 


61 


15 


Sbjct: 


207 




Query: 


112 


20 


Sbjct : 


26~7 




Query: 


169 




Sbjct: 


323 


25 


Query: 


227 




Sbjct: 


383 


30 


Query: 


287 




Sb j ct : 


442 




NO: 63) 





Matching Entry Begin 

(in SwissProt + End 
SpTrEMBL) 

TMS2_HUMAN [1-335] 



HEPS_MOUSE 
HEPS_RAT 
KAL HUMAN 



095518 



097506 



Table IF. BLASTP analysis results for AMF1 
Description 



HEPS_HUMAN [11-335] 



[9-335] 
[9-340] 
[90-335] 



KAL_MOUSE [97-335] 



KAL RAT 



TRANSMEMBRANE PROTEASE, SERINE 2 (EC 
3.4.21.-). 

SERINE PROTEASE HEPSIN (EC 3.4.21.-) 
(TRANSMEMBRANE PROTEASE, SERINE 1). 
SERINE PROTEASE HEPSIN (EC 3.4.21.-). 
SERINE PROTEASE HEPSIN (EC 3.4.21.-). 
PLASMA KALLIKREIN PRECURSOR (EC 3.421.34) 
(PLASMA PREKALLIKREINXKININOGENIN) 
(FLETCHER FACTOR). 

PLASMA KALLIKREIN PRECURSOR (EC 3.421.34) 
(PLASMA PREKALLIKREINXKININOGENIN) 
(FLETCHER FACTOR). 
[87-335] PLASMA KALLIKREIN PRECURSOR (EC 3.4.21 .34) 
(PLASMA PREKALLIKREIN)(KININOGENIN) 
(FLETCHER FACTOR). 
[92-329] DJ 1 1 70K4.2 (NOVEL TRYPSIN FAMILY PROTEIN 
WITH CLASS A LDL RECEPTORDOMAINS) 
(FRAGMENT). 
[90-336] ALLIKREFN. 



Score 



E 

Value 



266.0 le-70 

232.0 2er60 

230.0 le-59 

224.0 8e-58 

219.0 2e-56 

215.0 3e-55 

213.0 2e-54 

213.0 2e-54 

204.0 6e-52 
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The presence of identifiable domains in AMF1, as well as all other AMFX proteins, can 
be determined by searches using software algorithms such as PROSITE, DOMAIN, Blocks, 
Pfam, ProDomain, and Prints, and then determining the Interprd number by crossing the 
domain match (or numbers) using the Interpro website (http:www.ebi.ac.uk/ interproV 
DOMAIN results can then be collected from the Conserved Domain Database (CDD) with 
Reverse Position Specific BLAST analyses; This BEAST analysis software samples domains 
found in the Smart and Pfam collections. " ' . 

Expression information for AMFX RNA wa^&eriyed using tissue sources including, 
but not limited to, proprietary ^database sources^publ^ and/or 
RACE sources, as described in the Exampfe^AMEl: j5^^3^res^.iri^^e^.th^ollowing 
tissues: colon, gastric and ovmzar^ance^erive^ in fetal 

kidney and limg indi 

The nupleic acids and pra]^ 
implicated in various AMF-related pathologies and/or disorders. Foi example, a cDNA 
encoding the fibrillin-like protein, may be useful in gene therapy, and the fibrillin-like protein 
may be useful when administered to a subject ^jra ne^ th^rrof^ TTie poyel nucleic acid 
encoding AMF1 protein^ or fragments, thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acidwthe protein are to be ^ssessedr These 
materials are ftuAer useful in the; generation of antibodies that bind knmunospecifically to the 
novel substaraes of the m^ 

The AMFX nucleic acids and-prpteins are useful in potential diagnostic and therapeutic 
applications implicated in. various diseases anddi^rd^ other „ 

pathologies. For example, the con^ have efficacy for 

treatment of patients suffering from: colon, gastric, and ovarian cancer, and other diseases, 
disorders and conditions of the like. By way of nonlimiting example, the; compositions of the 
present invention will have efficacy for treatment of patients suffering from colon, gastric, and 
ovarian cancer. Additional AMF-related diseases and disorders are mentioned throughout the 
Specification. ! 

Further, the protein similarity information, expression pattern, and map location for 
AMF 1 suggests that AMF 1 may have important structural and/or physiological functions 
characteristic of the AMF family. Therefore; the nucleic acids and proteins of the invention are 
useiful in potential diagnostic and therapeutic applications and as a research tool. These include 
serving as a specific of selective nucleic acid ot protein diagnostic and/or prognostic marker, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed, as well as 
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potential therapeutic applications such as the following: (i) a protein therapeutic, (ii) a small 
molecule drug target, (iii) an antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), (iv) a nucleic acid useful in gene therapy (gene delivery/gene ablation), and (v) a 
composition promoting tissue regeneration in vitro and in vivo (vi) biological defense weapon. 
5 These materials are further useful in the generation of antibodies that bind 

immunospecifically to the novel AMF1 substances for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described iathp "Anti-AMFX. Antibodies" section 
below. In various embodiments, contemplated AMF1 epitopes are hydrophilic regions of the 
10 AMF1 polypeptide as predicted by software programs well known in the art that generate 
hydrophobicity or hydrophilicity plots. 
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AMF-2 (also referred to as Acc. No. 20421338) 

Novel AMF2 is a nephrin-like protein. The AMF2 clone i§ alternatively referred to 
herein as Acc No. 20421 338. The AMF2 nucleic acid (SEQ ID NO:3) of 379- nucleotides is 
1 5 shown in Table 2 A. In one embodiment, the AMF2 construct is an internal fragment of a 
larger gene, wherein it is contemplated that the ORF extends beyond the N- and C-termini 
depicted in Tables 2A and 2B. As shown in Table 2A, the first coding triplet beginning at 
position 1 is in bold letters. 

Table 2A. AMF2 nucleotide sequence (SEQ ID NO:3). 

20 GGAGGGCCTGTGATTCTACTGCA.GGCAGGCACCCCCCACAACC 
CC^CCATCATCTGGTTCCGGGACGGGACGCAG^ 

GAGGGAGACCACCGTGAGCCAACTGCTTATTAACCCCACGGACCTGGACATAGGGCGTGTCTTCACTTGCCGAAG^ 
ATGAACGAAGCCATCCCTAGTGGCAAGGAGACTTCCATCGAGCTGGATGT 
CCATTGAGCCACAGACGGGGC^GGAGGGTGAGCGTGTTGTCTTTACCTGCCAGGCC^ 



The encoded AMF2 protein (SEQ ID NO:4) of 126 amino acids (SEQ ID NO:4) is 
shown in Table 2B. 

Table 2B. AMF2 amino acid sequence (SEQ ID NO:4). 

GGPVILLQAGTPHNLTCRAFNAKJPAATII^ 

MMEAIPSGKETSIELDVHHPPTVTLSIEPQTGQEGERVVFTCQATANPEI 



In an analysis of public nucleic acid sequence databases, it was found, for example, that 
the AMF2 nucleic acid sequence has 1 62 of 1 63 bases (99%) identical to a Homo sapiens 
cDNA FLJ12646 fis, clone NT2RM400 1987,. weakly similar to Neural Cell Adhesion . 
35 Molecule 1, Large Isoform Precursor (GenBank Acc. No. AK022708) (SEQ ID NO:64) shown 
in Table 2C. 
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Table 2C. BLASTN alignment of AMF2 against NT2RM4001987 (SEQ ID NO:64) 

>AK0227 08 AK0227Q8 Homo sapiens cDNA FLJ12646 fis, clone NT2RM4001987 , weakly 
similar to NEURAL CELL ADHESION MOLECULE 1, LARGE ISOFORM PRECURSOR- 9/2000 Lenqth 
» 2656 v 
5 Score = 315 bits (159), Expect = 9e-84 

Identities = 162/163 (99%) 

Strand = Plus / Plus 

Query: 217 acttgccgaagcatgaacgaagccatccctagtggcaaggagacttccatcgagctggat 276 

10 w : - 1 1 1 1 1 4i i 1 1 1 1 1 c i it 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 { 1 1 1 1 1 { 1 1 1 1 1 1 1 1 1 

Sbjct: 1 acttgccgaagcatgaacgaagccatccctagtggcaaggagacttccatcgagctggat 60 
Query: 277 gtgcaccaccctcctacagtgaccctgtccattgagccacagacggggcaggagggtgag 336 

II 1 1 II II INI II II II II II II IIIIIIIHMI IMMMI II IIIIIIIIIIIII 

ID Sbjct: 61 : gto;caccaccctcctacagtga . . 
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Query : . 337. cgtgttgtctttacctgccaggccacagccaaccccqagatct: 579- 

ii t i ii 1 1 1 1 i r 1 1 ri ii 1 1 1 1 1 1 1 1 1 1 1 1 1 # # r i r i f r r t if ■ - 

Sbjct:. 1?1. cgtgttgtctttacctg^ \.;(SEQ ID NO: 64) 

A BLASTP search was performed against public protein databases. As shown in Table 
2D, the AMF2 protein Jttas 36 of. .120 amhip. aoid re$idye§ and 54 of 1 20 

; residues (45 %) po?itiyet wit^ Ae r ^ 
(fruit fly) neuro^ 

25 Table %B. BLASTT pf AM Neuromusculin (SEQ ID NO:65) 

>Q24273 Q24273 drosophila melanogaster (fruit fly) . neuromusculin. 5/1999 

•: : Lengtnu loaiv o: v - ^v ;i; ; : ^ . i . „ : . . -;: . - : . 

Score = 55.8 bits (132), Expect = 9e-08 
^ Identities =- 36/120 , (3 0%) ; Positives « 54/120 (45%), Gaps = 10/120 (8%) 

Query! 15 - LTCRAFNAKPAATTTWFR- — - - - DGTQQ EGAVASTELLKDGKRETTVSQLLINPTDLD I 68 

III l + M +1+ I + II II 1 1*1+ I ! + 

Sbjct: 282 LTCD IHGARPAVNLTWYNTTTI I S SGENEITEVRSKSI1EKSDGTFHTQSKI1I FNATRFEN 341 

35 Query: 69 GRVCTCE^MNEAIPSG^ 124 

■:. |U | + + , + |, _ M+. kt:^ll-^+l ; t - I |.+ ; | + ||| 

Sbjct: 342 DRVFRraAENlVEQINI^OTISSALTLEVLYPPV^^ 401 

AMF2 also has hi^bt homology 30 of 114 amino acids (26%) identical and 59 of 1 14 
40 amino acids (51%) positive with the 8621 amino acid protein Mus musculus (mouse) b-cell 
receptor cd22 precursor (teu-14) (b-lymphocyte cell adhesion molecule) (bl-cam) (Acc. No. 
P35329)(SEQ ID NO:66). Table 2E. 

d ; . Table 2E BLASTP of AMF2 against CD22 (SEQ ID NO:66) 

>CD22 MOUSE P35329 mus musculus (mouse) . b-cell receptor cd22 precursor (leu -14) (b- 
45 lymphocyte cell adbaeion molecule). (bl-cam) : . 
7/1999 

- Length = 862' " - / * " 

Score a 51.5 bits (121), Expect = 2e-06 

Identities- ■ 30-/114 (26%) > Positives" : i 59/114 (51%), Gaps - 13/114 (11%) 



50 



Query: 15~ LTC3iAFNEAK^ 71 

+ | | | ++ | +M + M | ++ ++| - +I + I+++ | + 

Sbjct: 270 MTCRWSSNPKI^TvAVSWPKr^PLED---- Q^LEQEQQMSKIiIIiHSVTKDMRGK 321 
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Query: 72 FTCRSMNEAIPSGKETSIELDVHHPPTVT-LSIEPQTGQEGERWFTCQATANP 124 

+ I++ 1 + 11+ +1 I I 1+ I + + | I +1 1+ I |++ |-+| 
Sbjct: 322 YRCQASNDIGP-GESEEVELTVHYAPEPSRVHIYPSPAEEGQSVELICESLASP 374 

AMF2 also has high homology to other amino acid sequences shown in the BLASTP 
alignment data in Table 2F. 

Table 2F. BLASTP alignments of AMF2 

BLASTP 



Score E 

Value 



i i% acore 

1U Sequences producing significant alignments: (bits) vaj .^ 

Q24273 Q24273 drosophila. melanogaster (fruit fly), neurorausc... .56 9e-08 

CD22_ MOUSE P35329 mus musculus. (mouse), b-cell receptor cd22. .. 52 2e-06 

097174 097174 drosophila melanogaster (fruit fly), eg: 163al0 . : . 50 Se-06 

Q9Z2H8 Q9z2h8 mus musculus. (mouse), immunosuperf amily protei... 49 le-05 



The presence of identifiable domains in AMF1, as well as all other AMFX proteins, can 
be determined by searches using software algorithms such as PROSITE, DOMAIN, Blocks, 
Pfam, ProDomain, and Prints, and then determining the Interpro number by crossing the 
domain match (or numbers) using the Interpro website (http:www.ebi.ac.uk/ interpro! 

20 DOMAIN results can then be collected from the Conserved Domain Database (CDD) with 

Reverse Position Specific BLAST analyses. This BLAST analysis software samples domains 
found in the Smart and Pfam collections. 

Expression information for AMF2 RNA was derived using tissue sources including, but 
not limited to* proprietary database sources^ public EST sources, literature sources, and/or 

25 RACE sources, as described in the Exa. pies. / MF2 is expressed in at least the following 
tissues: fetal kidney and several cell lines derived from renal cell carcinomas. It is also 
upregulated in brain tumor and melanoma derived cell lines. 

The nucleic acids and proteins of AMF1 are useful in potential therapeutic applications 
implicated in various AMF-related pathologies and/or disorders. For example, a cDNA 

30 encoding the nephrin-like protein may be useful in gene therapy, and the nephrin-like protein 
may be useful when administered to a subject in need thereof. The novel nucleic acid 
encoding AMF2 protein, or fragments thereof, may further be useftd in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. These 
materials are further useful in the generation of antibodies that bind immunospecifically to the 

35 novel substances of the invention for use in therapeutic or diagnostic methods. 

The AMF2 nucleic acids and proteins are useful in potential diagnostic and therapeutic 
applications implicated in various diseases and disorders described below and/or other 
pathologies. For example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from: reuial cell carcinoma, brain tumors, melanoma, congenital 
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nephritic syndrome of Finnish type and other diseases, disorders and conditions of the like. By 
way of nonlimiting example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from renal cell carcinoma, brain tumors, melanoma, congenital 
nephritic syndrome of Finnish type. Additional AMF2-related diseases and disorders are 
5 mentioned throughout the Specification; 

Further, the protein similarity information, expression pattern, and map location for 
AMF2 suggests that AMF2 may have important structural and/or physiological functions 
characteristic of the AMF 'family.- Therefore, the nucleic acids and proteins of the invention are 
useful in potential diagnostic and therapeutic applications and as a research tool. These include 

1 0 serving as a specific or selective nucleic acid or protein diagnostic and/or prognostic marker, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed, as well as 
potential therapeutic applications such as the following; (i) aprotein therapeutic,, (ii) a small - 
molecule drug target, (iii) an antibody target (therapeutic, diagnostic, drug targetirig/cytptoxic 
antibody), (iv) a nucleic acid useful in gene therapy (gene delivery/gene ablation), and (v) a 

15 composition promoting tissue regeneration in vitro and: /n vm> (vi):biological defense weapon. 
These materials are: further useful in the genemtion of antibodies thatbind 
immunospecifically to the novel AMF2 substances for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-AMFX Antibodies" section 

20 below. In various embodiments^xoiitemplated AMF2 epitopes are hydrophilic regions of the 
AMF2 polypeptide as predicted by software programs well known in the art that generate 
hydrophobicity or hydrophilicity plots. 

AMF-3 (also referred to as Acc. No. 27251385) 

Novel AMF3 is a fibrillin-like protein related to the gene. The AMF3 clone is 
25 alternatively referred to herein as Acc. No. 27251385. The AMF3 nucleic acid (SEQ ID NO:5) 
of 3374 nucleotides is shown in Table3A. The AMF3 open reading frame ("ORF") begins at 
nucleotides 3-5. The AMF3 ORF terminates at a TAG codon at nucleotides 3357-3359. AMF3 
appears to be a C-teiminal fragment, so it is contemplated that the ORF extends beyond the 
depicted N-terminus, As shown in Table 3 A, putative untranslated regions 5 ' to the start 
30 codon and 3 ' to the stop codon are underlined, and the first coding triplet and stop codon are in 
bold letters. 

Table 3A. AMF3 nucleotide sequence (SEQ ID NO: 5). 

GCCAGTCGAGGCAGCTGCGTCAACATGGT 

CAGCGCCGCATGTGAAGACTACCGGGCCGGCGCCTGCTTCTCAGTGCTTTTCGGGGGCCGCTGTGCTGGAGACCTC 
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GCCGGCCACTACACTCGCAGGCAGTGCTGCTGTGACAGGGGCAGGTGCTGGGCAGCTGGCCCGGTCCCTGAGCTGT 
GTCCTCCTCGGGGCTCCAATGAATTCCAGCAACTGTGCGCCCAGCGGCTGCCGCTGCTACCCGGCCACCCTGGCCT 
CTTCCCTGGCCTCCTGGGCTTCGGATCCAATGGCATGGGTCCCCCTCTTGGGCCAGCGCGACTCAACCCCCATGGC 
TCTGATGCGCGTGGGATCCCCAGCCTGGGCCCTGGCAACTCTAATATTGGCACTGCTACCCTGAACCAGACCATTG 
ACATOTGCCGACACTTCACCAACCTGTGTCTGAATGGCCGCTGCCTGCCCACGCCTTCCAGCTACCGCTGCGAGTG 
TAACGTGGGCTACACCCAGGACGTGCGCGGCGAGTGCATTGATGTAGACGAATGCACCAGCAGCCCCTGCCACCAC 
GGTGACTGCGTCAACATCCCCGGCACCTACCACTGCCGGTGCTACCCGGGCTTCCAGGCCACGCCCACCAGGCAGG 
CATGCGTGGATGTGGACGAGTGCA-fTGTCAGTGGTGGCCTTTGTCACCTGGGCCGCTGTGTCAACACAGAGGGCAG 
CTTCCAGTGTGTCTGCAATGCAGGCTTCGAGCTCAGCCCTGACGGCAAGAACTGTGTGGACCACAACGAGTGTGGC 
ACCAGCACCATGTGCGTCAACGGCGTGTGTCTCAACGAGGATGGCAGCTTCTCCTGCCTCTGCAAACCCGGCTTCC 
TGCTGGCGCCTGGCGGCCACTACTGCATGGACATTGACGAGTGCCAGACGCCCGGCATCTGCGTGAACGGCCACTG 
TACG?UICACCGAGGGCTCCTTCCGCTGCGAGTGCOTGGGGGG 

GACACCCACGTGCGCAGCACCTGCTATGGGGCCATCGAGAAGGGCTCCTGTGCCCGCCCCTTCCCTGGCACTGTCA 
CCAAGTCGGAGTGCT(^TGTGCC^VA.TCCGGACGACGGTTTTGGGGAGCCCTGCCAGCTTT 

CGCTGAGTTCCAGGCACTGTGCAGCAGTGGGCTTGGCATTACCACGGATGGTCGAGACATCAACGAGTGTGCTCTG 
GATCCTGAGGTTTGTGCCAATGGCGTGTGCGAGAACCTTCGGGGCAGCTACCGCTGTGTCTGCAACCTGGGTTATG 
AGGCAGGTGCCTCAGGCAAGGACTGCACAGACGTGGATGAGTGTGCCCTCAACAGCCTCCTGTGTGACAACGGGTG 
GTGCCAGAATAGCCCTGGCAGCTACAGCTGCTCCTGCCCCCCCGG 

AAAGATGTCGACGAATGCCTGTCCAGCCCGTGTGTGAGTGGCGTTTGTCGGAACCTGGCCGGCTCCTACACCTGCA 
20 AATGTGGCCCTGGCAGCCGGCTGGACCCCTCTGGTACCTTCTGTCTAGACAGCACCAAGGGCACCTGC 

GATCGAGGAGAGCCGCTGTGAGGTGAACCT.TCAGGGAGCCAGCCTGCGGTCTGAGTGCTGTGCCACCCTCGGGGCA 
GCCTGGGGGAGCCCCTGCGAACGCTGCGAGATCGACCCTGCCTGTGCCCGGGGCTTTGCCCGGATGACGGGTGTCA 

CCTGCGATGATGTGAACGAGTGTGAGTCCTTCCCGGGAGTCTGTCCCAACGGGCGTTGCGTCAAC^CTGCTGGGTC 
TTTCCGCTGTGAGTGTCCAGAGGGCCTGATGCT^ 

25 TGTTTCCTGCGATGGGATGAGGATGAGTGTGGGGTCACCCTGCCTGGCAAGTACCGGATGGACGTCTGCTGCTGCT 
CCATCGGGGCCGTGTGGGGAGTCGAGTGCGAGGCCTGCCCGGATCCCGAGTCTCTGGAGTTCGCCAGCCTGTGCCC 
GCGGGGGCl^GGCTTCGCC^GCCGGGACTTCCTGTCTGGCCGACCATTCTATAAAGATGTGAATGAATGCAAGGTG 
TTCCCTGGCCTCTGCACGCACGGTACCTGCAGAAACACGGTGGGCAGCTTCCACTGCGCCTGTGCGGGGGGCTTCG 
CCCTGGATGCCCAOTAACGGAACTGCACAGATATCGACGAGTGTCGCAtCTCTCCTGACCTm 
CTGTGTCAACACGCCGGGCAGCTTTGAGTGCGAGTGTTTTCCCGGCTACGAGAGTGGCrrTCATGCTGATGAAGAAC 
TGCATGGACGTGGACGAGTGTGCAAGGGACCCGCTGCTCTGCCGGGGAGGCACTTGCACCAACACGGATGGGAGCT 
ACAAGTGCCAGTGTCCCCCTGGGCATGAGCTGACGGCCAAGGGCACTGCCTGTGAGGACATCGATGAGTGCTCCCT 
GAGTGATGGCCTGTGTCCCCATGGCCAGTGTGTCAATGTCATCGGTGCCTTCCAGTGCTCCTGCCATGCCGGCTTC 
CAGAGCACACCTGACCGCC^GGGOTGCGTGGACATCAACGAATGCGGGGTCCAGAATGGTGGGTGTGACG 
35 GTATTAACACTGAGGGCAGCTACCGGTGCAGCTGTGGGCAGGGCTACTCGCTGATGCCCGACGGAAGGGCATGTGC 
AGACGTGGACGAGTGTGAAGAGA^CCCCCGCGTTTGTGACCAAGGCCACTGCACCAACATGCC^IGGGGGTCACCGC 
TGCCTGTGCTATGATGGCTTCATCaGCCACGCC^iGACATGAGGACATGTGTTGATGTGGATGAGTGTGACCTGAACC 
CTCACATCTGCCTCCATGGGGACTGCGAGAACACGAAGGGTTCCTTTGTCTGCCACTGTCAGCTGGGCTA 
CAGGAAGGGGGCCACAGGCTGCTCTGATGTGGATGAATGCGAGGTTGGAGGACACAACTGTGACAGTCACGCCTCC 
40 TGTCTCT^CATCCCGGGGAGTTTC^GCTCTAGGTGC^ 

TGC^TGAATGCGTCTCC<^GGAGCACCGGTGC71GCCCAAGAGGTGACTGTCTCAATGTCCCTGGCTCCTACCGCTG 

CACCTGCCGCCAGGGCTTTGCCGGGGATGGCTTCTTCTGCGAAGACAGGGATGAATGTGCCGAGAACGTGGACCTC 
TGTGACAACGGGTA GTGCCTCAATGCGCCC 

45 The encoded AMF3 protein (SEQ ID NO:6) of 1 1 1 8 amino acids (SEQ ID NO:6) is 

shown in Table 3B. 

Table 3B. AMF3 amimo acid sequence (SEQ ID NO:6) 

QGGS CVNMVGS FHCRCFVGHRLSDSS AACEDYRAGACFS VLFGGRCAGDLAGHY^ 

P PRGSNEFQQIiCAQRLPIiLPGHPGLFPGIjIiGFGSNGMG P PLGPARIjNPHGS DARGI PS LGPGNSNT GTATLNQT ID 
ICRHFTNIiCLNGRCLPTPSSYRCECNVGYTQDVRGECIDVDECT^ 
CVDVDECIVSGGLCHLGRCVOTEGSFQCVCNAGFELSPDGKNCTOHNECATSTMCW 

LAPGGHYCMDIDECQTPGI CVNGHCTNTEGS FRCQCLGGIJVVGTDGRVCVDTHVRSTCYGAIEKGS CARPFPGTVT 
KSECCCANPDHGTOEPCQLCPAKKSAEFQAIjCSSGLGITTDGRDINEGAIj^ 

AGASGKDCTDVDECALNSLLCDNGWCQNSPGSYSCSCPPGFHFWQDTEICKDVDECIjSSPCVSGV 

cgpgsrldpsgtfcldstkgtcwi^iqesrcevnlqgaslrseccati^ 
cddvnecesfpgvcpngrcvntagsfrcecpegiimldasgrij 

igawgveceacpdpeslefaslcprgixsfasrdflsgrpfykdvneckvfpglcthgtcrnt^ 
i^aqernctdidecrispdlcxk^tcvntpgsfec^ 

kcqcppgheltakgtacedidecslsix3lcphgqcvnvigafqcschagfqstpdrqgcv/dinecrv 
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INTEGSYRCSCGQGYSLMPDGRAC^VDVDECEENPRV^ 

H I CLHGDCENTKGS FVCHCQLG YMVRKG ATG C S D VDE CEVGGHNCD S HAS CLNI PGS FSCRCLPGWVGDGFECHDL 
DECVSQEHRCSPRGDCIjNVPGSYRCTCRQGFAGDGFFCEDRDECAENVDLCDNG 

5 In an analy sis of public nucleic acid sequence databases, it was found, for example, that 

a fragment of the AMF3 nucleic acid sequence has 1 34 of 1 34 bases (1 00%) identical to a 
Homo sapiens cDNA FLJ20029 fis, clone ADSE02022 (GenBank Acc. No. AK000036) (SEQ 
ID NO:67) shown in Table 3C. 

Table 3G. BLASTN of AMF3 against FLJ20029 (SEQ ID NO:67) 

10 >AK000036 AK00 0036 Homo sapiens cDNA FLJ20029 £is, v clone ADSE02022 . 2/2000 
Length = 1399;, Strand = Plus / PTus~ " " 
Score - 266 bits (134)., Expect = 7e-68 . .. , _ . 

Identities 134/134" (100%) "■■ " ' " '' : " - ^ - 
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30 



Query: 


2306 


Sbjct: 


190 


Query : 


2366 


Sbjct: 


250 


Query: 


' 2426 


Sbjct: 


310 



1 1 II III 1 1 J I III 1,1 1 II 1 1 II II 1,1 1 1 1 III J 1 1 1 II II I II 1 1 I.I 1 1 1 1 1 1 1 III II 



20 IIMMIIIIIIIIIIII IIIINMIIIIIINIIIIIIIIIIIM Mil lllllllll 

cacgccgggcagctttgagtgcgagtgttttcccggctacgagagtggcttcatgctgat 3 09 



II II III I Ml III.. - : ,-,:r - ... ■ -~ 

gaagaactgcatgg-323 - - T ; 

In addition, the AMF3 nucleic acid sequence has high homology to other nucleic acid 
sequences, as shown in BLASTN alignment data in Table 3D. 2 

Table 3D. BLASTN alignment results for AMF3 ^ 



Score 


E 


(bits) 


Value 


266 


, 7e-68 


125 


2e-25 


109 


ie-20 


98 


4e-17 


9 98 


4e-l7 


90 


9e-l5 


90 


9e-l5 



Sequences producing significant alignments 

AK000036 AK0O0036 Homo sapiens cDNA FLJ20029 fis, clone ADSE. 
AF135060 AF13S060 Rattus norvegicus ibrillin-2 mRNA, t complex 
MUS FBN2 L3 9 7 9 0 Mus mus cuius f ibr ill in : 2' ( fbn2 ) ' gene , comp let . 
35 HSU03272 U03272 Human fibrillin- 2 mRNA, complete cds . 6/1994 
HSFIB5 X62009 Homo -sapiens partial mRNA for fibrillin. 5; "9/19 
AC025169 AC025169 Homo sapiens" chromosome 5 clone CTC-352M6 , . 
AC0 10461 AC0104 6 1 Homo sapiens t chromosome 5 clone CTDr2275A5 . 

40 A BLASTP search was performed against public protein databases. As shown in Table 

3E, the AMF3 protein has 766 of 1 178 amino acid residues (65 %) identical to, and 913 of 
1178 amino acid residueis (77 %) positive with, the 2911 amino acid residue long Homo 
sapiens (human), fibrillin 2 precursor (Acc. No. P35556 ) (SEQ ID NO:68). 

Table 3E. BLASTP of AMF3 against FBN2 (SEQ ID NO:68) 

45 >FBN2_H0MAW P35556 homo sapiens, (human) . fibrillin 2 precursor.. Xl/1997 
Length - 2911 . .. 
Score » 1804 bits (4622), Expect - 0.0 

Identities =. 766/1178 (65%), Positives = 913/1178 (77%), Gaps = 62/1178 (5%) 

50 Query: 1 QGGS CVNMVGS FHCRCP VGHRLSDS S AACE ---- 30 

- MM + I lilt III! l> ■ II 

Sb j c t : 2 87 QGGNCINTVGS FECRCPAGHKQSETTQKCEDIDECS 1 1 PGICETGECSNTVGSYFCVCPR 346 
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Query: 31 DYRAGACFSVLFGGRCAGDLAGHYTRRQCCCDRGRCWAAGPVPELCPP 78 

I M III I 111) +| | | + | | I 1+ llil I +1 I M 
Sbjct: 347 G YVTS TDGS RC I DQRTGMC FS GLVNGRCAQE LPGRMTKMQCCCEPGRCWG I GT I PEAC P V 406 

5 Query: 79 RGSNEFQQLCAQRLPL- -LPGHPGLFPGLLGFGSNGMGPPLGPARLNPHGSDARGIP 13 3 

III I+++II 11+ +11 I II I Ml I 11+ || 

Sbjct: 407 RG S E E YRRLCMDGLPMGG I PGS AGS R PG - - GTGGNG FAP S GNGNGYG PGGTG F I P I PGGN 464 

Query: 134 --SLGPGNSNIGT ATLNQTIDICRHFTNLCLNGRCLPTPSSYRCECNVGY 181 

10 " I I I + +1 llllllil+l llllllll+ll llllllll+ll 

Sbjct: 4 6S GFS PGVGGAGVGAGGQG P 1 1 TGLT I LNQT I D I CKHHANLCLNGRC I P TVS S YR CE CNMG Y 524 

Query,: 182 TQDVRGECIDVDECTSSPCHHGDCVNIPGTYHCRCYPGFQATPTRQACVDVDECIVSGGL 241 

K 1 1 l + IIMIIIM + ll +IIIII II + I + I + I+ III NI + IM + I + IMI +1 I 

ID Sbjct: 525 KQDANGDCIDVDECTSNPC™GDC\^ 584 

Query: 24 2 CHLGRCVNTEGSFQCTCNAGFELSPDGKNCVDHNECM FS CLCKP 3 01 

I MMI++IMM + MIIIII + llllllll+ll 1+ M + M + I + IIMM l + MI 
20 Sfc>:)Ct: 585 CKNGRCVNSDGS FQCI CNAGFELTTDGKN CVDHpECTTTNMCLNGMC I NEDG S F KC I CKP 644 

Query: 302 GFLLAPGGHYCMD IDECQTPGI CVNGHCTNTEGS FRCQCLGGIiAVGTDGRVCVDTHVRST 361 

m+iii i M i+miiiin+im i+i i rni i inn riiiiiiM+Mi 

Sbj Ct : 645 GFVLAPNGRYCHIJVDEC^TPGI CMNGHCINSEGS FRCDCPPGLiAVGMDGRVCVDTHMRST -704 
25 Query: 362 CYGAI E KGS CARP FPGTVT ICS ECCCANP DHG FGEP CQLCPAKNS AEFQALCS S G LG I TTD 421 

Ml l + ll I 1IIM II 1 1 1 II IIMM III III 1 1 1 1 1 1 1 1 1 iiiii + ni I 

Sbjct: 705 CYGGIKKGVCVRPFPGAVT^ECCCaNPDYGFGEP.CQPCPAKNSAEraGLCSjSGVGITTO 764 . 
Query: 422 GRD INECALD PEVCANGVCENLRGS YRCV CNLG YEAGAS GKDCTDVDE CALNS LL CDNGW 481 

30 ~ iii!iiriiii++iiii+iiiiMiiii ii iii MI++I- i+m +1 ruin 

Sbjct: 765 GRDINEC^UIiDPDIC^^GICENIjRGSYRCNC^SGYEPDASGRNCIDIDECLVNRIiIiCDNGIj 824 
Query.: 4 82 CQNS PGS YS CS CPPGFHFWQDTEI CKDVDECLSS PCVSGVCRNLAGS YTCKCX3 PGSRLDP 541 

„ i + i+i 1 1 1 1 i+i 1 1 r+ i +i r 1+1++1 1 i+rii+i in ii+i+mm+i 

3D Sbjct: 825 CRNTPGS YS CTCPPGYVFRTETETCED INECESNPCVNGACRNNLGS FNCE CS PGS KLS S 884 
Query: 54 2 SGTFCI^STKGTCWLKIQESRCEVNLQGASI^SECC^TLGAAWGSPCERCEIDPACARGF 601 

+ i i+ii nun II+IMIM+ ii+i+imiiiiiiimiiiii+i III) 

Sbjct: 885 TGLICIDSLKGTO^LNIQDSRCEVNINGATLKSECCATLGAAWGSPCERCEIjDTACPRGL 944 

40 

Query: 6 02 ARMTGVTCDDWECESFPGVCPNGRCVNTAGSFRCECPEGLMIJDASGRLCVDVRLEPCFL 661 

M+ I II I + I III II I INI III II 11+ III 11 II III II +II + I + 1+T+I l + l 

Sbjct: 945 ARI KGVTCEDVNECEVFPGVCPNGRCVNSKGS FHCECPEGIjTLDGTGRVCLDIRMEQCYL 1004 
45 Query: 662 RWDEDECGVTLPGKYRMDVCCCSIGAVWGVECEACPDPESLEFASLCPRGLGFT^SR-DFL 72 0 

+ 111 III +11 l + l M i II ++-I-I. II III II. 1 + 1+ +|| Ml M.l + I I I 

Sbjct: 1005 KWDEDECIHPVPGKFRMDACCCAVGAAWGTECEECPKPGTKEYETLCPRGAGFANRGDVIj 1064 
Query: 721 SGRPFYKDVNECKVFPGLCTHGTCRNTVGS FHCACAGGFALDAQERNCTD IDECR I S PDL 780 

50 _ +MIIIM+IIII iii+ii+i ini+iii i i inn +iiimiiiiimii 

Sbjct: 1065 TGRPFYKDINECKAFPGMCTYGKCRNTIGSFKCRCNSGFALDMEERNCTDIDECRISPDL 1124 
Query: 781 CXKJGTCVNTPGSFECECFPGYESGFMLMKNCTIDVDEC^^ 840 

„ IN MMMMIMM lllllll + IMIM + l I l + lllllllll II + M++I I 

->-> Sbjct: 1125 CGSGICV1WPGSFECECFBGYESGFMMMKNC^^IDGCERNPLLCRGGTCVNTEGSFQCDC 1184 

Query: 841 PPGHELTAKGTACEDIDECSLSDGLCPHGQCVNVIGAFQCSCHAGFQSTPDRQGCVDINE 900 

I I I I 1+ I I l + l I I I I I II +I + III + II +1111+ l+l + lllllll ll + l 

Sbjct: 1185 PLGHELS PSRED CVDINE CS LS DNLCRNGKCVNM IGTYQCS CNPG YQATPDRQGCTD IDE 1244 

60 

Query: 901 CRVQNGGCDVHRI NTEGS YRC S CGQGYS LMPDGRACADVDE CEENPRVCDQGHCTNMPGG 960 

' I + Mill l + llll Ml +II + IIIMI + III + IMI II +11 I 1 1 1 + II 

Sbjct: 1245 CMIMNGGCHDTQCTNSEGSYECSCSEGYALMPDGRSC^DIDECEKNPDICDGGQCTNIPGE 1304 
65 Query: 961 HRCLCYDGFMATPDMRTCVDVT>ECDLNPHICIjHGDCENTK^ 102 0 

+iiiiimn+ ii+M+M+imi +ii+ i+iiiiiiii+iiiiiii i+ii iii 

Sbjct: 13 05 YRCL CYDGFMAS MDMKTC I DVNE CDLNSN I CMFGE CENTKGS F I CHCQLG YS VKKGTTGC 1364 
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Query: 1021 S DVDECE VGGHNCDSHAS CLN I PGS FS CRCLPG WVGDGFE CHDLDE CVSQEHRCS PRGDC 1080 

+ IIMII+I- mi 1 1 1 1 1 1 1 1 1 ii i i ii+i+i *i inn + i + n i 

Sbjct: 1365 TDVDECEIGAHNOTMHASCLNIPGSFKCS(^EGWIGNGIKCIDLDECSNGTHQCSINAQC 1424 
5 Query: 1081 LNVPGSYRCTCRQGFAGDGFFCEDRDECAENVDLCDNG 1118 

+i nun i +ii. \\\\ i t 111111++11+11 

Sbjct: 1425 VNTPGSYRCACSEGFTGDGFTCSDVDECAENINLCENG 1462 

AMF3 also has high homology to other amino acid sequences, as shown in BLASTP 
10 alignment data.shown in Table 3F. 

Table 3F. BLASTP alignment results for AMF3 

* - Score E 
Sequences producing significant alignments: (bits) Value 

FBN2_HUMAN P35556 homo sapiens- (human) . fibrillin 2 precurso. . . 1804 0.0 

15 FBN2_MOUSE Q61555 mus musculus (mouse), fibrillin 2 precurso... 1802 0.0 

OB8840 O88840 mus musculus (mouse), mutant fibrillin-l. 5/1999 1596 0.0 

FBN1JBOVIN P98133 bos taurus (bovine), fibrillin 1 precursor. . . 1594 0.0 

FBN1_HUMAN P35555 homo sapiens (human) . fibrillin 1 precurso. . . 1591 " 0.0 

FBN1 MOUSE Q6 1554 mus musculus (mouse), fibrillin 1 precurso... 1590 0.0 

20 Q60784 Q60784 mus musculus "(mouse) ." fibrillin-l^ (fragment) .... 1108 0.0 

P87363 P87363 gallus gallus (chicken), fibrillin-l (fragment... 713 0.0 

Q 6 0789 Q60789 mus musculus 1 * ( mous e ) . , f ibr ill in- 2 ; ( f ragmen t ) . . 534 e - 1 5 0 

The presence of identifiable domains in AMF3, as well as all other AMFX proteins, can 

25 be determined by searches using software algorithms such as PROSITE, DOMAIN, Blocks, 
Pfam, ProDomaih, and Prints, and then determining the Interpro number by crossing the 
domain match (or numbers) using the Interpro website (http:www.ebi.ac.uk/ interproV 
DOMAIN results can then be collected from the Conserved Domain Database (CDD) with 
Reverse Position Specific BLAST analyses. This BLAST analysis software samples domains 

30 found in the Smart and Pfam collections. 

Expression information for AMFX RNA was derived using tissue sources including, 
but not limited to, proprietary database sources, public EST sources, literature sources, and/or 
RACE sources, as described in the Examples. AMF3 is expressed in at least the following 
tissues: colon and gastric cancers. Highest expression is lung cancer cell lines and this 

35 correlates with expression in fetal lung, indicating an oncofetal phenotype. 

The nucleic acids and proteins of AMF3 are useful in potential therapeutic applications 
implicated in various AMF-related pathologies and/or disorders. For example, a cDNA 
encoding the fibrillin-like protein may be useful in gene therapy, and the fibrillin-like protein 
may be useful when administered to a subject in need thereof. The novel nucleic acid 

40 encoding AMF3 protein, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. These 
materials are further useful in the generation of antibodies that bind immunospecifically to the 
novel substances of the invention for use in therapeutic or diagnostic methods. 
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The AMF3 nucleic acids and proteins are useful in potential diagnostic and therapeutic 
applications implicated in various diseases and disorders described below and/or other 
pathologies. For example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from: Marfan syndrome, congenital contractural 
arachnodactyly, Marfan-like habitus, familial adenomatous polyposis and other diseases, 
disorders and conditions of the like. By way of nonlimiting example, the compositions of the 
present invention will have efficacy for treatment of patients suffering from Marfan syndrome, 
congenital contractural arachnodactyly, Marfan-like habitus, familial adenomatous polyposis. 
Additional AMF3 -related diseases and disorders are mentioned throughout the Specification. 

Further,- the protein similarity information* expression pattern, and map location for 
AMF3* sB^g§5t^ T that AMF3 may have important structural and/or physiological functions 
characteristic of the r AMF family.- Therefore, the nucleic acids and proteins of the invention are 
useful in f&tential diagnostic and therapeutic applications and as a research tool. These include 
serving as a specific or selective nucleic acid or protein diagnostic and/or prognostic marker, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed, as well as 
potential therapeutic applications such as the following: (i) a protein therapeutic, (ii) a small 
molecule drag target, (iii) an antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), (iv) a nucleic acid useful in gene; therapy (gene delivery/gene ablation), and (v) a 
composition promoting tissue regeneration in vitro and in vivo (vi) biological defense weapon. 

These materials are further useful in the generation of antibodies that bind 
immunospecifieally to the novel AMF3 substances for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-AMFX Antibodies" section 
below. In various embodiments, contemplated AMF3 epitopes are hydrophilic regions of the 
AMF3 polypeptide as predicted by software programs well known in the art that generate 
hydrophobicity or hydrophilicity plots. 

AMF-4 (also referred to as Acc No. 27486474) 

Novel AMF4 is a plasminogen-like protein. The AMF4 clone is alternatively referred 
to herein as Acc. No. 27486474. . The AME4 nucleic acid of 439 nucleotides is shown in Table 
4 A. The AMF4 open reading frame ("ORF") begins at positions 2-5. The AMF4 ORF 
terminates at a TAA codon at nucleotides 93-95. As shown in Table 4A, putative untranslated 
regions 3' to the stop codon are underlined, and the stop codon is in bold letters. AMF4 does 
not begin at an ATG start site, so it is most likely a C-terminal coding fragment. It is 
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contemplated that the AMF4 ORF extends in the 5' direction of the nucleic acid (SEQ ID 
NO:7) and the N-terminal direction of the polypeptide (SEQ ID NO:8). 

Table 4A. AMF4 nucleic acid (SEQ ID NO:7) 

T CAC GGG AAT AAG CCT GGG CCC GTC CCT. TTG ATT TCC AAC AAG ATC 
TGC AAC CAC AGG GAC GTG TAC GGT GGC ATC ATC TCC CCC TCC ATG 
CTC TGC GCG GGC TAC CTG ACG GGT GGC GTG GAC AGC TGC CAG GGG 
GAC AGC GGG GGG CCC CTG GTG TGT CAA GAG AGG AGG CTG TGG AAG 
TTA GTG GGA GCG ACC AGC TTT GGC ATC GGC TGC GCA GAG GTG AAC 
AAG CCT GGG GTG TAC ACC GTG TGA CCT QCT TCC TGG ACT GGA TCC 
ACG AGC AGA TGG AGA GAG ACC TAA AAA CCT GAA GAG GAA GGG GAT 
AAG TAG CCA C(jT GAG TTC CTG AGG TGA TGA AGA CAG CCC GAT CCT 
CCC CTG GAC TCC CGT GTA" GGA ACC TGC ACA CGA GCA GAC ACC CTt " 
GGA GCT CTG -AGT- TCC GGC ACC AGT AGC AGG CCC : ■ : c *•- 

The encoded AMF4 polypeptide (SEQ ID NO:8) is shown using the one-letter amino 
acid code in Table 4B a ^ ■ r r : 1 ~ 

Table 4A. AMF4 polypeptide (SEQ ID NO:S) 

"llGNia?GPVPEn5^^ 
GATSFGIGCAEVNKPGVYTVSPPSWTGSTSRWRET . „ 



In an analysis of public nucleic acid sequence databases, it was found, for example, that 
the AMF4 nucleic acid sequence has 418 of 420 bases (99%) identical to a serine protease 
(GenBank Acc. No. AB038159) (SEQ ID NO:69) shown in Table 4C. In all BLAST 
alignments herein, the "E-value" or "Expect" value is a numeric indication of the probabihty 
that the aligned sequences could have achieved their similarity to the BLAST query sequence 
by chance alone, within the database that was searched. For example, as shown in Table 4C, 
the probability that the subject ("Sbjct") retrieved from the AMF4 BLAST analysis, in this 
case the serine protease gene/protein, matched the Query AMF4 sequence purely by chance is 
zero, E value 0.0. 

Table 4C. BLASTN of AMF4 against AB038159 (SEQ ID NO:69) 

>AB038159 H. sapiens TMPRSS3c mRMA for serine protease, complete cds. 1/2001 
Length = 2135 Strand = Plus / Plus 
Score = 809 bits (408), Expect =0.0 
Identities = 418/420 (99%), Gaps = 1/420 (0%): 

Query: 21 ccgtccctttgatttccaacaagatctgcaaccacagggacgtgtacggtggcatcatct 80 

1 1 III I Mill III III III 1 1 III I Mil III III I Mil lllllll 1 1 II I llllll I 

Sbjct: 950 ccgtccctttgatttccaacaagatctgcaaccacagggacgtgtacggtggcatcatct 1009 
Query: 81 ccccctccatgctctgcgcgggctacctgacgggtggcgtggacagctgccagggggaca 140 

I MINI IMIIIIIII MINIMI IIIIIMIIIIIIIIIIIIIII MM llllllll 

Sbjct: 1010 ccccctccatgctctgcgcgggctacctgacgggtggcgtggacagctgccagggggaca 1069 
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Query: 141 gcggggggcccctggtgtgtcaagagaggaggctgtggaagttagtgggagcgaccagct 2 00 

sbH t in7n IIIIIIMIIIMilllllllMMIIIIIIIIIIIMIIIIMlllllilllllllTll 

Stgct: 1070 gcggggggcccctggtgtgtcaagagaggaggctgtggaagttagtgggagcgaccagct 1129 

Query: 201 ttggcatcggctgcgcagaggtgaacaagcctggggtgtaca-ccgtgtcacctccttcc 259 

sbict wo imimiiiiiiiiiiii iiiiiini n iiiiiinmimii 

soj cc : 1130 ttggcatcggctgcgcagaggtgaacaagcctggggtgtacacccgtgtcacctccttcc 1189 

Query: 260 tggactggatccacgagcagatggagagagacctaaaaacctgaagaggaagcra-gataaq 319 

llllllMIIIIIIMIilllMlllllllllllilllllllllllllllllllll IN 

Sojct: 1190 tggactggatccacgagcagatggagagagacctaaaaacctgaagaggaaggggacaag 1249 

Query: 320 ; tagccacctgagttcctgaggtgatgaagacagcccgatcctcccctggactcccgtcTta 379 

15 sbio , iiiiMiiiiiiiniiii iiiiiiiiiiiiiiiriiiiiiiiiiiiiiiiriii 

strict: 12^0 tagccacctgagttcctgaggtgatgaagacagcccgatcctcccctggactcccgtgta 1309 

Query; 3 80 ggaacctgcacacgagcagacacccttggagctctgagttccggcaccagtagcaggccc 439 

9o „ „,« 'i 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 nnm 1 1 1 1 1 1 1 1 1 1 1 mTiu 1 nTi rTi mTi 1 1 

srrjct : 1310 ggaacctgcacacgagcagacacccttggagctctgagttccggcaccagtagcaggccc 13 69 

Additional BLASTN information for related nucleic acid sequences is shown in Table 
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Tabic 4B BLASTN analysis resulte for AME4 



Sequences producing significant alignments: ~ " " . 

AB038Z59 AB038159 Homo sapiens TMPRSS3c mRNA for serine prot . 
AB038156- AB038158. Homo sapiens, TMPRSS3b mRNA for serine prot . 
AB038157 AB038157 Homo sapiens TMPRSS3a mRNA for serine prot. 
30 AF2013 80 AF2013 80 Homo sapiens serine protease TADG12. mRNA, . . 
AP00174 6 AP001746 Homo sapiens genomic DMA, chromosome 21c, 
AP001623 AP001623 Homo sapiens, genomic DMA, chromosome 21," c. 
AC015555 AC015555 Homo sapiens chromosome 21 clone RP11-113P 



Score 


B 


(bits) 


Value 


809 


0.0 


809 


0.0 


809 


0.0 


753 


0.0 


3 01 


2e-79 


3 01 


2e-79 


3 01 


2e-79 



35 . A BLASTP search was performed against public protein databases. The results from 

this comparison are shown in Table. 4E. : 

Table 4E BLASTP analysis results for AMF4 

Sequences producing significant alignments: 
40 PLMN_PIG P06867 sus scrof a (pig) . plasminogen (ec 3 . 4 . 21 , 7) . . 

PLMN_BOVIN P06868 bos taurus (bovine) . plasminogen precursor. 

HEPS_MOUSE 035453 mus musculus (mouse) . serine protease heps. 

PLMN_HORSB P80010 equus car>allus (horse), plasminogen (ec 3.. 

PLMN_MACMU P12545 macaca mulatta (rhesus macaque) . plasminog. 
45 HEPS_RAT Q05511 rattus norvegicus (rat) . serine protease hep. 

HEPS_HUMAN P0S981 homo sapiens (human) . serine protease heps. 

PLMN_HUMAN P00747 homo sapiens (human) . plasminogen precurso. 

Q15146 . Q15146 homo sapiens (human), plasminogen precursor, i: 
50 046507 O46507 papio hamadryas (hamadryas baboon), plasminoge . 



(59 %) identical to, and 60 of 81 residues (73 %) positive with, the 790 amino acid residue 
long plasminogen from pig (Acc. No. P06867) (SEQ ID NO:70). 
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Score 


E 


(bits) 


Value 


102 


6e- 


■22 


101 


2e- 


■21 


98 


2e- 


-20 


97 


3e- 


•20 


96 


5e- 


20 


96 


5e- 


20 


. . ' 96 


5e- 


20 


96 


6e- 


20 


96 


6e- 


20 


96 


8e- 
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Table 4F. BLASTP of AMF4 against P06867 (SEQ ID NO:70) 

PLMN_PIG P06867 sus scrota (pig), plasminogen (ec 3.4.21.7). 10/1996 Length = 790 
Score » 102 bits (252), Expect ~ 6e-22 
^ Identities « 48/81 (59%), Positives = 60/81 (73%), Gaps v = 1/81 (1%) 

Query: 4 KPGPVPLISNKI OJHRDVYGGI IS PSMLCAGYLRGGVDSCQGDSGGPLVCQERRLWKLVG 63 

I + 1+1 ll + ll +11 +11+ llll+l ll+lllllllllllll 1+ +l| 
Sbjct: 697 KKARLPVIENKVCNRYEYLGGKVSPNELCAGHIiAGGIDSCQGDSGGPLVCFEKDKYILQG 756 

10 Query: 64 ARSFGIGCAEVNKPGVY-RVS 83 . . 

I+I+III llllll III 
Sbjct: 757 VTSWGLGCALPNKPGVYVRVS 777 



15 In addition, as shown in Table 4G, the AMF4 protein has 47 of 82 amino acid residues 

(57 %) identical to, and 58 of 82 residues (70 %) positive with, the 812 amino acid residue 
long bovine plasminogen precursor (Acc. No. P06868) (SEQ ED NO:71). 



Table 4G. BLASTP of AMF4 against P06868 (SEQ ID NO:71) 

20 PLMN BOVTN P06868 bos taurus plasminogen precursor (ec 3.4.21.7) 11/1997 Length = 

812 ■ . ,• •• " ■ -. .; 1 : ;• :: ••. \ ■ 

Score =* 101 bits (248) , Expect = 2e-21 

Identities » .47/82- (57%)., Positives - .58/82 (70%) , Gaps » 1/82 (1%) 

25 Query: 4 KPGPVPLISNKICNHRDVYGGIISPSMLCAGYLRGGVIXSCQGDSGGPLVCQERRLWKLVG 63 

I +1+1 li+ll + 1+1+ llll+l II lllllllllllll 1+ +11 
Sbjct: 719 K2SAHLP VIENKVaTRNEYLDGRVKPTELC&GHLI GGTDS CQGDSGGPL VCFEKDK Y I LQG 778.. 

Query: 64 ARSFGIGCAEVNKPGVY-RVSP 84 

30 l+l+lll llllll Mil 

Sbjct: 779 VTSWGLGCARPWKPGVYVRVSP 800 - 



The presence of identifiable domains in AMF4, as well as all other AMFX proteins, can 
be determined by searches using software algorithms such as PROSITE, DOMAIN, Blocks, 

35 Pfam, ProDomain, and Prints, and then determining the Interpro number by crossing the 
domain match (or numbers) using the Interpro website (http:www.ebi.ac.uk/ interpro) . 
DOMAIN results can then be collected from the Conserved Domain Database (CDD) with 
Reverse Position Specific BLAST analyses. This BLAST analysis software samples domains 
found in the Smartand Pfam collections. For this DOMAIN sequence alignments, fully 

40 conserved single residues are indicated by black shading "strong" semi-conserved residues are 
indicated by grey. The "strong" group of conserved amino acid residues may be any one of the 
following groups of amino acids: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, 
FYW. AMF4 shows good homology with the consensus sequence of the trypsin-like serine 
protease domain (Smart|Tryp_SPc, E= 2e-21) and the trypsin domain (Pfam00089, E= 2e-14). 

45 The alignment with the trypsin-like serine protease domain (SEQ ID NO:72)(labeled 
"Consensus") is shown in Table 4H. 
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TABLE 4HL DOMAIN ANALYSIS FOR AMF4 - ALIGNMENT WITH TRYPSIN- 
LIKE SERINE PROTEASE DOMAIN (SEQ ID NO:72) 

■•--|.--.|....|-...|....|....|....|.-..|.-..|....|....|....| 

Consensus RIVGGSEANIGSFPWQVSLQYRGGG-RHFCGGSLISPRWVLTAAHCVYGSD SS 52 

AMF4 ± 

„ r ' . ...,).-.. |.. ..|.... i |....|....|....|....|....|....|....| 

Consensus . IRVRLGSHDLSSGEET QTVKVSKVIVHPNVNP- -STYDNDIALLKLKKPVTLSDTVR 107" 



1- 



....|....|....|....|....|....|....|....|....|.,..|... 

Consensus PI CLP S3 G YNVP AGTTCTVSGWGRT S E SGGSLPDTLQBVKg@I\ 

AMF 4 — 1 HnwKPfiD^T.-! 



Consensus 
AMF4 



Consensus 
AMF4 



The trypsin-like serine protease domain is present in a large family of proteins, 
5 includinglnaay flKaf are synthesized as inactive precursor zymogens that are cleaved during 
limited proteolysis to generate their active forms. 

AMF4 has similarity to plasminogens. Plasmin dissolves the fibrin of blood clots and 
acts as a proteolytic factor in a variety of other processes including embryonic development, 
tissue remodeling, tumor invasion, and inflammation; in ovulation it weakens the walls of the 
1 0 graafian follicle. It activates the urokinase-type plasminogen activator, collagenases and 
several complement zymogens, such as cl and c5. it cleaves fibrin, fibronectin, 
thrombospondin, laminin and von Willebrand factor. 

Plasminogen is the zymogen in the circulating blood" from which plasmin is formed. 
Plasminogen is a single-chain glycoprotein with 790 amino acid residues. Activation to the 
1 5 active form, plasmin, by urokinase (Online Mendelian Inheritance in Man ("OMIM") Acc. No. 
191840) involves cleavage at the Arg-Val bond between residues 560 and 561, resulting in the 
formation of the 2-chain plasmin molecule held together by 2 disulfide linkages. The heavier 
chain contains about 41 1 residues and the lighter chain about 233. The main function of 
plasmin is the digestion of fibrin in blood clots. Plasmin is a proteolytic enzyme with a 
20 specificity similar to that of trypsin. Like trypsin, plasmin belongs to the family of serine 
proteinases, in which the active site catalytic triad, His-57, Asp-102, and Ser-195 
(chymotrypsin numbering), is situated in the light chain. 

The plasminogen activation system is one pathway that has been consistently 
implicated in cancer. Its relevance to cancer extends from being responsible for many of the 
25 hemorrhagic episodes that occur in cancer patients to being fundamental to many, if not all of 
the molecular mechanisms that define tumor progression. Extravasation and intravasation of 
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solid malignant tumors is controlled by attachment of tumor cells to components of the 
basement membrane and the extracellular matrix, by local proteolysis and tumor cell 
migration. Strong clinical and experimental evidence has accumulated that the tumor- 
associated serine protease plasmin, its activator uPA (urokinase-type plasminogen activator), 
5 the receptor uPA-R (CD87), and the inhibitors PAI-1 and PAI-2 are linked to cancer invasion 
and metastasis. In cancer, increase of uPA, uPA-R, and/or PAI-1 is associated with tumor 
progression and with shortened disease-free and/or overall survival in patients afflicted with 
malignant solid tumors. uPA and/or its inhibitor PAI- 1 appear to be one of the strongest 
prognostic markers so far described. Strong prognostic value to predict disease recurrence and 

10 overall survival has been documented for patients with cancer of the breast, ovary, cervix, 
endometrium, stomach, colon, lung, bladder, kidney, brain, and soft-tissue. Due to the strong 
correlation between elevated uPA and/or PAI-1 values in primary cancer tissues arid the tumor 
invasion/ metastasis capacity of cancer cells; proteolytic factors have been selected as targets 
for therapy. 7: - . • ■ - : - - 

15 A novel angiogenesis inhibitor that mediated the suppression of metastases from a 

Lewis lung carcinoma was isolated and designated the inhibitor angiostatin. See, e.g., O'Reilly 
et aL 1 994 Cell 79: 3 1 5-328. Angiostatin is a 3 S-kD internal fragment of plasminogen 
containing at least 3 of the kringles of plasminogen. Recombinant fragments of angiostatin 
show inhibitory activity in vitro. See, e.g., Cao et aL 1996 J. Clin. Invest. 101 : 1055-1063. 

20 Angiostatin is produced by the proteolytic cleavage of plasminogen by a serine protease 

produced by several human prostate carcinoma cell lines. See, e.g., Gately et aL 1 996 Cancer 
Res. 56: 4887-4890. A shift of balance of tumor angiogenesis by gene transfer of a cDNA 
coding for mouse angiostatin into murine T241 fibrosarcoma cells suppresses primary and 
metastatic tumor growth in vivo. See, e.g., Cao et al. 1998 J. Clin. Invest. 101 : 1055-1063. 

25 Implementation of stable clones expressing mouse angiostatin in C57B 1 6/J mice inhibited 

primary tumor growth by an average of 77%. After removal of primary tumors, the pulmonary 
micrometastases in approximately 70% of mice remained in a microscopic dormant and 
avascular state for 2 to 5 months. The tumor cells in the dormant micrometastases exhibited a 
high rate of apoptosis balanced by a high proliferation rate. These studies showed the 

30 diminished growth of lung metastases after removal of the primary tumor, suggesting that 
metastases are self-inhibitory by halting angiogenesis. The data may also provide a novel 
approach for cancer therapy by anti-angiogenic gene therapy with a specific angiogenesis 
inhibitor. The angiostatin-induced long-term dormancy of lung metastases was equivalent to 14 
to 15 human years (when 1 mouse day is equivalent to approximately 35 human days). 

24 

BNSDOCID: <WO 0174897A2J_>- 



WO 01/74897 



PCT/US01/10892 



Overexpression of AMF4 in concert with a plasminogen activator such as uPA 
(urokinase) could potentially stimulate tumor cell invasion and migration. Alternatively, 
AMF4 could serve as a substrate for an unidentified serine protease akin to the protease that 
cleaves plasminogen to angiostatin. In this manner, tumor cells might limit the production of 
this important anti-angiogenic factor. 

Therapeutic targeting of AMF4 is anticipated to limit or block the extent of tumor cell 
invasion/motility and metastasis. Potentially therapeutic targeting of AMF4 might shift the 
balance in favor of the production of angiostatin or a similar molecule with anti-angiogenic 
activity: . - 

Expression- information for AMFX RNA was derived using tissue sources including, 
but not limited to, proprietary database sources, public EST source^ literature sources, and/or 
RACE sources, as described in the Examples.. 

The nucleic acids and proteins of AMF4 are useful in potential therapeutic applications 
implicated in various AMF-related pathologies and/or disorders. For example, a cDNA 
encoding the trypsin-like serine protease protein may be useful in gene therapy, and the 
trypsin-like serine protease protein may be useful when administered to a subject in need 
thereof. The novel nucleic acid encoding AMF4 protein, or fragments thereof, may fhrther be 
usefiil in diagnostic applications, wherein the presence or amount of the nucleic acid or the 
protein are to be assessed. These materials are further useful in the generation of antibodies 
that bind immunospecifically to the novel substances of the invention for use in therapeutic or 
diagnostic methods. 

The AMFX nucleic acids and proteins are useful in potential diagnostic and therapeutic 
applications implicated in various diseases and disorders described below and/or other 
pathologies. For example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from: cancer, blood clotting disorders and other diseases, 
disorders and conditions of the like. By way of nonlimiting example, the compositions of the 
present invention will have efficacy for treatment of patients suffering from cancer, blood 
clotting disorders. Additional AMF-related diseases and disorders are mentioned throughout 
the Specification. 

Further, the protein similarity information, expression pattern, and map. location for 
AMF4 suggests that AMF4 may have important structural and/or physiological functions 
characteristic of the trypsin-like serine protease family. Therefore, the nucleic acids and 
proteins of the invention are usefiil in potential diagnostic and therapeutic applications and as a 
research tool. These include serving as a specific or selective nucleic acid or protein diagnostic 
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and/or prognostic marker, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed, as well as potential therapeutic applications such as the following: (i) a protein 
therapeutic, (ii) a small molecule drug target, (iii) an antibody target (therapeutic, diagnostic, 
drug targeting/cytotoxic antibody), (iv) a nucleic acid useful in gene therapy (gene 
5. delivery/gene ablation), and (v) a composition promoting tissue regeneration in vitro and in 
vivo (vi) biological defense weapon. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the novel AMF4 substances for use in therapeutic or diagnostic 
methods. These antibodies may be generatedaceording to methods known in the art, using 
10 prediction from hydrophobicity charts; as described in the "Anti-AMFX Antibodies" section 
below. In various embodiments, contemplated AMF4 epitopes are-hydrophilic regions of the 
AMF4 polypeptide as predicted by software programs well known in the art that generate 
hydrophobicity or hydrophilicity plots. 

AMF-5 (also referred to as Acc No^29691387) ^ ^ [j 

1 5 Novel AMF5 is an organic anion transporting peptide-like protein ("OTAP") protein. 

The AMF5 clone is alternatively referred to herein as Aca.No. 29691387. The AMF5 nucleic 
acid of 2646 nucleotides is shown in Table 5 A. The AMF5 open reading frame ("ORF") 
begins at nucleotides 3-5. AMF5 appears to be an internal fragment, so it is contemplated that 
the ORF could extend beyond the N- and C- termini depicted in Tables 5A and 5B. As shown 

20 in Table 5A, the first coding triplet is in bold letters. 

Table 5A. AMFS nucleotide sequence (SEQ m NO:9). 

TGT CATTGT CCTTTTACC TATTATATTTTTTCAT ACTC TGXG AAAACAi^TCAGTTGC CGGACTAA.C CATG AC CT A. 
TGATGGAAATAATCCAGTGACATCTCA^ 

AGTCAGTGGGAACCAGTCTGTGGGAACAATGGAATAACTTACC^ 
25 GTGGTATTAAAAAGCATACAGTGTTTTATAACTGTAG^^ 

AGCGCACTTGGGTGAATGCCCAAGAGATAATACTTGTACAAGOA^ 
AACTCTTTGTTCTCTG CAACAGGAGGTACC 

The encoded AMF5 protein (SEQ ED NO: 1 0) is a 1 36 amino acid protein shown in 
30 Table 5B. 

Table SB. AMFS amino acid sequence (SEQ ID NO: 10) 

SliSFYIjLYFFILCENKSVAGIjTNrrYI^^ PCLAGCKS S S 

GIKKHTVFYNCSCVEVTGMlHUnfSAHLGECPI^ 

35 In an analysis of public nucleic acid sequence databases, it was found, for example, that 

the AMF5 nucleic acid sequence has 363 of 374 bases (97%) identical to a Homo sapiens 
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nxRNA for organic anion transporter 8 (SLC21A8 gene) (GenBank Acc. No. AJ251506) (SEQ 
ID NO:73) shown in Table 5C. 

Tabic 5C BLASTN of AMF5 against OAT-8 mRNA (SEQ ID NO:73) 

>HSA251506 AJ251506 Homo sapiens mRNA for organic anion transporter 8 
(SLC21A8 gene). 7/2000 -Length = 2646; Strand = Plus / Plus 
Score = 654 bits (330) , Expect = 0.0 
Identities = 363/374 (97%) 

Query: 37 tctgtgaaaacaaatcagttgccggactaaccatgacctatgatggaaataatccaqtqa 96 

10 „ w . t mi mi 1 1 in 1 1 1 1 ii 1 1 1 1 i nil i nun urn mi mi in iiiii 



15 



20 



45 



Query : 


3 / 


faD] Ct : 


133 0 


Query: 


97 _ 


Sbjct : 


1390 


Query: 


157 


Sb j ct : 


1450 


Query: 


217 


Sbjct:. 


1510 


Query ; 


.277 


Sbjct: 


1570 


Query: 


337 


Sbjct: 


1630 


Query:. 


397 


Sbjct: 


16 90 



catctcatagagatgtgccactttcttattgcaactcagactgcaattgtgatgaaagtc 156 

iiniiii mi iiri in i ii i nil rni 1 1 inn innTiinniiiiiii 



agtgggaaccagtctgtgggaacaatggaataacttacctgtcaccttgtctagcaqgat 216 

ninirniririnniiniiirMTniirinrinriniriiiniiriiiii 



gcaaatcctcaagtggtattaaaaagcatacagtgttttataactgtagttgtgtggaag 276 

i n 1 1 rim 1 1 1 nil n iiiiiirni in ri 1 1 if 1 1 1 1 1 1 1 mini nnii if 

gcaaat cctcaagtggt at taaaaagcatacagtgt 1 1 tataactgt agt tgtgtggaag . 15.69 
25 Query; .277 taactggtctccagaacagaaattactcagcgcacttgggtgaatgcccaagagataata 336 

i mil i t 1 1 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 n j i t m i tn 1 1 i 1 1 1 i i t iTi 1 1 1 1 1 1 1 

taactggtctccagaacagaaattactcagcac^ 162 S 

„ - - . cttgtacaaggaaatttttcatctatgttgcaattcaagtcataaactctttgttctctq 3 96 

30 . 1 1 1 r 1 1 ri 1 1 1 m r 1 1 1 1 1 n 1 1 1 1 1 1 1 1 1 1 ifi r n i in ! i ii i m 1 1 1 1 1 1 r 1 1 1 1 1 

oi" *" cttgtacaaggaaatttttca.tctatgttgcaattcaagtcataaactctttgttctctg 1689 

caacaggaggtacc 410 

milium 1 "" ■ — ■ ^ - - 

35 Sbjct: 

In addition, the AMF5 nucleic acid sequence has high homology to other nucleic acid 
sequences whose BLASTN alignment data is shown in Table 5D. ; 

Table 5D; BLASTN alignment results for AMF5 

40 



Score 


E 


(bits) 


Value 


.654 


0.0 


654 


0.0 


557 


e-156 


557 


e-156 


557 


e-156 


549 


e-154 



Sequences producing significant alignments: 

HSA251506 AJ251506 Homo sapiens mRNA for organic anion trans 

AF137815 AF187815 Homo sapiens liver- specif ic organic anion . . . 
AF205071 AF205071 Homo sapiens organic anion transport polyp... 
AF060S00 AF060500 Homo sapiens liver specific transporter mR. . . 
AB026257 AB026257 Homo sapiens mRNA for organic anion transp . . . 
HSA132573 AJ132573 Homo sapiens mRNA for organic anion trans... 

A BLASTP search was performed against public protein databases. As shown in 
50 Table 5E, the AMF5 protein has 1 19 of 1 36 amino acid residues (87 %) identical to, and 125 
of 136 residues (91 %) positive with, the 691 amino acid residue long Homo sapiens (human), 
liver-specific organic anion transporter (organic anion transport polypeptide 2) (oatp 2) (Acc. 
No.)(SEQIDNO:74). 
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Table 5E. BLASTP of AMFSa against OATP (SEQ ID NO:74) 

>OAT6_HDMAN Q9y616 homo sapiens (human) . liver -specific organic anion transporter 
(organic anion transport polypeptide 2) (oatp 2) . 10/2000 Length =691 
Score = 265 bits (670) , Expect = 9e-71 
Identities = 119/136 (B7%), Positives = 125/136 (91%) 

Query: 1 SLSFYLLYFFILCENKSVAGLRMRYDGNNPVTSHRDVPLEYCNSDCNCDESQWEPVCGNN 60 

I M I I I I II I I I I I I I I I i I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Sbjct: 418 SLSFYLIjYFFILCENKSVAGLTMTYDGNNPVTSHRDVPLSYCNSDCNCDESQWEPyCGNN 4 77 

Query: 61 GITYLSPCLAGGKSSSGIKKHTVFYNCSCVEV^GLQNRNYSAHI^ECPRDNTCTRKFFIY 120 

IIIMIIIIillllll II 1 1 1 Mll-1 1 1! II i INI 1 1 1 ! ill I k Mill* + 

Sbjct: 4 78 GITYISPCIiAGCKSSSGNKKPIVFYNCSCLEVTGLQNRNYSAHLGECPRDDACTRKFYFF 53 7 

15 Query: 121 VAIQVINSLFSATGGT 13 6 

MMM III III 
Sbjct: 538 VAIQVLNLFFSALGGT 553 

20 The amino acid sequence of AMF5 also has high homology to the amino acid 

sequences shown in BLASTP alignment data in Table 5F 

Table 5F. BLASTP alignment results for AMF5 

Score E 7 

Sequences producing significant alignments: (bits) Value 

25 OAT 6_HUMAN Q9y616 homo sapiens (human) . liver-specific organ. . . 265 9e-71 

OAT3_RAT 088397 rattus norvegicus (rat) . sodium- independent ... 108 2e-23 

088397 088397 rattus norvegicus (rat), organic anion transpo. . . 108 2e-23 " 

OATP_HUMAN P46721 homo sapiens (human).- sodium- independent o... 106 9e-23 

OAT2JRAT 035913 rattus norvegicus (rat) . sodium- independent ... 102 le-21 

30 OATP_RAT P46720 rattus norvegicus (rat) . sodium- independent ... 99 8e-21 

OATK_RAT P70502 rattus rioryegicus (rat) sodium- independent ... 98 2e-20 ~ 

P70502 P70502 rattus norvegicus (rat) . oat-kl. 1/1999 98 2e-20 

. The presence of identifiable domains in AMF5, as well as all other AMFX proteins, can 
35 be determined by searches using software algorithms such as PROSITE, DOMAIN, Blocks, 
Pfam, ProDomain, and Prints, and then determining the Interpro number by crossing the 
domain match (or numbers) using the Interpro website ( http:www.ebi.ac.uk/ interproV 
DOMAIN results can then be collected from the Conserved Domain Database (CDD) with 
Reverse Position Specific BLAST analyses. This BLAST analysis software samples domains 
40 found in the Smart and Pfam collections. 

Expression information for AMFX RNA was derived using tissue sources including, 
but not limited to, proprietary database sources, public EST sources, literature sources, and/or 
RACE sources, as described in the Examples. AMF5 is expressed in at least the following 
tissues: liver, brain, lung, kidney, and testis; additional transcripts were also observed. The 
45 authors stated that the extra-hepatic expression of OATP suggests a general role for OATP in 
trans-epithelial organic anion transport.. 

The nucleic acids and proteins of AMF5 are useful in potential therapeutic applications 
implicated in various AMF-related pathologies and/or disorders. For example, a cDNA 
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encoding the organic anion transporting peptide -like protein may be useful in gene therapy, 
and the organic anion transporting peptide -like protein may be useful when administered to a 
subject in need thereof. The novel nucleic acid encoding AMF5 protein, or fragments thereof, 
may further be useful in diagnostic applications, wherein the presence or amount of the nucleic 
5 acid or the protein are to be assessed. These materials are further useful in the generation of 
antibodies that bind irnmunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. 

The AMFX nucleic acids and proteins are usefid in potential diagnostic and therapeutic 
applications implicated in various diseases and disorders described below and/or other 
1 0 pathologies- For example, the compositions, of the present invention, will have efficacy for 

treatment of patients suffering from: colon adenocarcinomas, small cell lung cancers, ovarian 
cancers, prostate cancers and gliomas , and other diseases,, disorders and conditions of the like. 
By way of noniimiting_example, the compositions of the present invention will have efficacy 
for treatment of patients suffering from colon.adenocarcinomas, small cell lung-cancers, 
1 5 ovarian cancers, prostate cancers and gliomas. Additional AMF-related diseases and disorders 

are mentioned throughout the Specification. 

Further, the protein similarity information, expression pattern, and map location for 
AMFS suggests that AMF5 may have important structural and/or physiological functions 
characteristic of the AMF family. Therefore, the nucleic acids and proteins of the invention are 
20 useful in potential diagnostic and therapeutic applications and as a research tool. These include 
serving as a s^cific or selective nucleic acid or protein diagnostic and/or prognostic marker, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed, as well as 
potential therapeutic applications such as the following: (i) a protein therapeutic, (ii) a small 
molecule drug target, (iii) an antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
25 antibody), (iv) a nucleic acid useful in gene therapy (gene delivery/gene ablation), and (v) a 

composition promoting tissue regeneration in vitio and in vivo (vi) biological defense weapon. 

These materials are further useful in the generation of antibodies that bind 
irnmunospecifically to the novel AMFS substances for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
30 prediction from hydrophobicity charts, as described in the "Anti-AMFX Antibodies" section 
below. In various embodiments, contemplated AMFS epitopes are hydrophilic regions of the 
AMFS polypeptide as predicted by software programs well known in the art that generate 
hydrophobicityior hydrophilicity plots. 
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AMF-6 (also referred to as Acc. No. 38905521) 

Novel AMF6 is MEGF protein-related. The AMF6 clone is alternatively referred to 
herein as Acc. No. 3890552L The AMF6 nucleic acid (SEQ ID NO:ll) of 332 nucleotides is 
shown in Table 6A. The AMF6 open reading frame ("ORF") begins at nucleotides 3-5. The 
AMF6 ORF terminates at nucleotides 3 18-320. AMF5 appears to be an internal fragment so it 
is contemplated that the ORF could extend beyond the N- and C- termini. As shown in Table 
6 A, putative untranslated regions 5' to the start codon and 3' to the stop codon are underlined, 

and the start and stop codons are in bold letters. J ' 

i ■ 

Table 6A. AMF6 nucleotide sequence (SEQ ID NO: 11). 

TOGC^GCCOTGGAGGAGCCGATGGTGGACCTGGACGGCGAGCT^CCTTTCGTGCGGCCCCTGCCCC^C^TTGCCGT 
GCTCCAGGACGAGCTGCCGCAACTCTTCCAGGATGACGACGTCGGGGCCGATGAGGAAGAGGCAGAGTTGCGGGGC 
GAACACACGCTCAGAX3AGAAGTTTGT 

GCAGGAACGGAGGGACCTGCCTCCTGGGCCTGGATGGCTGTGAT^ 
TGAGATTTGTCCTCCGGA 

The encoded AMF6 protein (SEQ ID NO:12) is a 106 amino acid protein shown in 
Table 6B. 

Table 6B. AMF6 amino acid sequence (SEQ ID NO: 12) 

AALEEPIWDLDGELPFVRPLPHIAVLQDELPQLFQDDDVGADEEEAEIiRGEHTLTEKFVCLDDSFGHDCSLTCDDC 
20 RNGGTCLLGLDGCDCPEGWTGVICNEICPP X ■ 

In an analysis of public nucleic acid sequence databases, it was found, for example, that 
the AMF6 nucleic acid sequence has one fragment 1 54 of 1 79 bases (86%) identical and a 
second fragment 79 of 91 bases (86%) identical to Rattus norvegicus mRNA for MEGF6, 
25 complete cds (GenBank Acc. No. AB01 1 532) (SEQ ID NOs:75 and 76) shown in Table 6C. 

Table 6C. BLASTN of AMF6 against MEGF6 mRNA (SEQ ID NO:75 and 76) 

>AB011532 AB011532 Rattus norvegicus mRNA for MEGF 6 , complete cds. 8/1958 
Length = 5523 
Score = 157 bits (79), Expect = 4e-36 
30 Identities = 154/179 (86%) 

Sbjct: residues 1738 to 1916 (SEQ ID NO:75); Strand = Plus / Plus 



35 



40 



Query: 141 gagttgcggggcgaacacacgctcacagagaagtttgtctgcctggatgactcctttggc 20 0 

llllllll II MM I Mill Ml! Illllill II Mill Mill lllllil II 

Sbjct; 173 B gagttgcgtggagaacacacgctcactgagaagtttgtctgcttggatcactccttcggg 17 97 
Query: 201 catgactgcagcttgacctgtgatgactgcaggaacggagggacctgcctcctgggcctg 260 

llllllllllll I Mill 1 1 III I MM II 1 1 II Mill III III Mill I 

Sbjct: 1793 catgactgcagcctaacctgcgatgactgcaggaatggggggacttgcttcccgggccag 1857 

Query: 261 gatggctgtgattgccccgaggggtggactggggttatttgcaatgagatttgtcctcc 319 

I I I I I I I I I I I I I I M I I I I J I I I I I II I I II I I I I I I I I I I I I I I I I I I 
Sbjct: 1858 gacggctgtgactgcccagagggctggactggaatcatctgcaatgagacttgtcctcc 1916 



45 Score - 85.7 bits (43), Expect » le-14 
Identities = 79/91 (86%) 

Sbjct: residues 1616 to 1706 (SEQ ID NO:76); Strand = Plus / Plus 
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Query: 22 tggtggacctggacggcgagctgcctttcgtgcggcccctgccccacattgccgtgctcc 81 

MIIMIIIIIM |j| 1 1 1 1 1 1 || lllll IIIIIIIIIIIIIIMM Mm 

Sb^ct: 1616 tggtggacctggatggccggctgccctttgtgcggcccctgccccacattgcggtgctga 1675 
Query: 82 aggacgagctgccgcaactcttccaggatga 112 

Ml MINIM I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

Sb3ct: 1676 gggatgagctgccccgactcttccaggatga 1706 

A BLASTP search was performed against public protein databases. As shown in Table 
6D, the AMF6 protein has 89 of 107 amino acid residues (83%) identical to, and 95 of 107 
residues (88 %) positive with, the 1574 amino acid residue long Rathis norvegicus (rat). megfS 
(Acc. No. 088281) (SEQ ID NO:77). J 3 / U : - : 

Table 6D. BLASTP of AMF6a against MEGF6 (SEQ ID NO:77) 

>OS8281 088281 rafctus aorvegicua • (ra t ) v ' megf 5 . 5/1999 Length = 1574 
Score a 194 bits (489) , Expect » le-49 

Identities.^ 89/107 ( 83%.),, ..Positives » 95/107. (89%).,.. .Gaps » 3/107 (2%) 
Query: 2 ALEEPMVPLDGELPFVRPLPHIAVLQDELPQLFQDDDVGADEEEA- - ELRGEHTLTEKFV 59 

-■. + 111 htiii itiiifiiiiiri*riii + iHn rr+iiu timiiiiiihf - 

SbjCt : 45S SLEESWDLTCRLPPyRPL^ 5 14 
Query:. 60 CLDDSFGHDCSLTCDDC^GGTCLI/3IiDGCDCPEGWTGVICNEICPP 106 v 

iii MiiiiiiHHiMiiii r.iiiiimiii+iirF'iii • <■ ■ ■■■■ 

Sbjct: 515 CLDHS FGHDCSLTCDDCRNGGTCFPGQDGCDCPEGWTG 1 1 CNETCPP 561'. 

The presence of identifiable domains in AMF6, as well as all other AMFX proteins, can 
be determined by searches using software algorithms such as PROSITE, DOMAIN, Blocks, 
Pfam, ProDomain, and Prints, and then determining tfie Interpro number by crossing the 
domain match (or numbers) using the Interpro website (http:www.ebi ac.uk/ interproV 
DOMAIN results can then be collected from the Conserved Domain Database (CDD) with 
Reverse Position Specific BLAST analyses: This BLAST analysis software samples domains 
found in the Smart and Pfam collections. _ . } 

Expression information for AMFX RNA was derived using tissue sources including, 
but not limited to, proprietary database sources, public EST sources, literature sources, and/or 
RACE sources, as described in the Examples. AMF6 is expressed in several regions of rat 
brain. 

The nucleic acids and proteins of AMF6 are useful in potential therapeutic applications 

implicated in various AMF-related pathologies and/or disorders. For example, a cDNA 

encoding the MEGF-like protein may be useful in gene therapy, and the MEGF-like protein 

may be useful when administered to a subject in need thereof. The novel nucleic acid 

encoding AMF6 protein, or fragments thereof, may further be useful in diagnostic applications, 

wherein the presence or amount of the nucleic acid or the protein are to be assessed. These 

materials are further Useful in the generation of antibodies that bind immunospecifically to the 

novel substances of the invention for use in therapeutic or diagnostic methods. 
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The AMFX nucleic acids and proteins are useful in potential diagnostic and therapeutic 
applications implicated in various diseases and disorders described below and/or other 
pathologies. For example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from: gastric and renal cell carcinoma, breast and ovarian 
cancer, and other diseases, disorders and conditions of the like. By way of nonlimiting 
example, the compositions of the present invention will have efficacy for treatment of patients 
suffering from gastric and renal cell carcinoma, breast and ovarian cancer . Additional AMF- 
related diseases and disorders are mentioned throughout the Specification. 

Further, the protein similarity information, expression pattern, and map location for 
AMF6 suggests that AMF6 may have important structural and/or physiological functions 
characteristic of the AMF family. Therefore, the nucleic acids and proteins of the invention are 
useful in potential diagnostic and therapeutic applications and as a research tool. These include 
serving as a specific or selective nucleic acid or protein diagnostic and/or prognostic marker, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed, as well as 
potential therapeutic applications such as the following: (i) a protein therapeutic, (ii) a small 
molecule drug target, (iii) an antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), (iv) a nucleic acid useful in gene therapy (gene delivery/gene ablation), and (v) a 
composition promoting tissue regeneration in vitro and in vivo (vi) biological defense weapon. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the novel AMF6 substances for use in therapeutic or diagnostic - 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-AMFX Antibodies" section 
below. In various embodiments, contemplated AMF6 epitopes are hydrophilic regions of the 
AMF6 polypeptide as predicted by software programs well known in the art that generate 
hydrophobicity or hydrophilicity plots. 

AMF-7 (also referred to as Acc. No. 4194093) 

Novel AMF7 is an Interleukin-1 1-like ("IL-1 1") protein. The AMF7 clone is 
alternatively referred to herein as Acc. No. 4194093. The AMF7 nucleic acid (SEQ ID NO:13) 
of 1332 nucleotides is shown in Table 7A. The AMF7 open reading frame ("ORF") begins at 
nucleotides 2-4. The AMF7 ORF terminates at a TGA codon at nucleotides 1307-1309. 
AMF7 appears to be a C-terminal fragment, so it is contemplated that the ORF extends beyond 
the N-terminus. As shown in Table 7 A, putative untranslated regions 5' to the start codon and 
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TTCGACACAGACCAOTrGLATT^ 



GGAGCXTCCTGCAGGCCATGGTGGG^ 
CCAAGACCATGTCCTGTOTGGAGGCCCC^ 



3' to the stop codon are underlined, and the first coding triplet and the stop codon are in bold 
letters. 

Table 7A. AMF7 nucleotide sequence (SEQ ID NO:13). 

CGCCTTCATGCTGCCGGCGGGCTGCTCGCGCCGGCTGGTGGCCGAGCTGCAGGGCGCCCTGGACGCCTGCGCACAG 
:> CX^C^^TTGGAATTGGAGGAGAGCCTGCXii^ 
CTTTGAAGCCACCTCCAGGGCC^GAAACT^ 
CAAAGAGTTGGAGTTTCTGACCCAGGCACT 

AGAGACAAGGCCCCCAGCCTGAAATCTAGGTCCATTGTCACCTCTTCJTGGC71CGACAGCCTCCGCCCCACCGCATT 
CCCCAGGCCAAGCTGGTGGCCATGCTTCAGACACG^^ 
10 GGGCC^CCCTGAGCGCCGGCTGCTGTC^GTGG^ 

GGGGCGGGGCTCAC^GACCAGCAAATGGCCCCATCCGG^ 



15 



CCAGC^GATCCACTTGGAAAAGGTCCTGAra 
20 CCCTGGCTGGCCCTGTGCCGGGCTGTGCACAGC^ 

ATGAACCTGCAGTCTG AGCCTTTCCCATGCTGCCCTCGGC; 

The encoded AMF7 protein (SEQ ID NO: 14) is a 435 amiiib acid protein shown in 

Table 7B. : -' ' : : • ' " ' - ' ■ - : - ' : 

25 Table 7B. AMF7 amino acid sequence (SEQ ID NO:14) 

AFMIjPAGCSRRIA^LQGAIJ)AC^R^^ 

KELEFLTQALEKAVRVRRG I TKAEERDKAP S LKS RS I VTS SGTTAS APPHS PGQAGGHASDTRPTKGLRQTTVP AK 
GHPERRLLS VGDGTRVGMGARTPRPGAGIiRDQQMAPSAAPQ^ QNS SLjWAQLS 

STQTSDSTDAAAAKTQFLQNMQTASGGPQP . 
30 EGLQAMVGQCIJIRLQEIJiAAVAEQPPRPCPVGRPPGASPSCGGR^ 
QQIHLEKVI>lAEliPI*VSAAQPQGPPWI»Ai»CRAVHS 

In ah analysis of public nucleic acid sequence databases, it was found, for example, that 
a fragment of the : AMF7 nucleic acid sequence has 1299 of 1300 bases (99%) identical to a 
35 Homo sapiens cDNAFLJ13909 fis, clone Y79AA 1000065 (GenBank Acc. No. AK023971) 
(SEQ ID NO:78) shown in Table 7C. 

Table 7C. BLASTN of AMF7 against cDNA FLJ13909 (SEQ ID NO:78) 

>AK023971 AK023971 Homo sapiens cDNA FLJ13909 fis, clone Y79AA1000065 . 
9/2000 Length = 1708 Strand = Plus /.Plus 
40 Score = 2569 bits (1296), Expect « 0.0 
Identities = 1299/1300 (99*) 

Query: 33 ggctggtggccgagctgcagggcgccctggacgcctgcgcacagcgacaattgcaattgg 92 

M I I I I I I I I I I I I I I I II I I I II I I I I I I | | | | | I | | M I 1 I i I I I I I I I I I I 1 I I I I I 
Sbjct: 138 ggctggtggccgagctgcagggcgccctggacgcctgcgcacagcgacaattgcaattgg 197 



50 



Query: 93 agcagagcctgcgcgtttgccgtcggctgctgcatgcctgggaaccaactgggacccggg 152 

I I I I M M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | 
Sbjct : 198 agcagagcctgcgcgtttgccgtcggctgctgcatgcctgggaaccaactgggacccggg 257 
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Query: 


153 




Sb j ct : 


258 


5 


Query: 


213 




Sbjct : 


318 


10 


Query: 


273 


Sb j ct : 


378 




Query: 


333 


15 


Sbjct: 


438 




Query: 


393 




Sbjct: 


498 




Query: 


453 




Sb j ct : 


558 


25 


Query: 


513 




Sbjct: 


618 




Query: 


573 




Sbjct: 


678 




Query: 


633 


35 


Sb j ct : 


738 




Query : 


693 


Art 


Sbjct: 


798 




Query : 


753 




Sb j ct : 


858 


45 


Query: 


813 




Sbjct: 


918 


JV 


Query: 


873 




Sb j ct : 


978 




Query : 


933 • 


55 


Sb j ct : 


1038 






993 


60 


Sbjct: 


1098 




Query: 


1053 




Sbjct: 


1158 


65 


Query: 


1113 




Sb j ct : 


1218 



ctttgaagccacctccagggccagaaactaatggagaggacccccttccagcatgcacac 212 

III IIMM lil IMIMII III II II III 1,11 IIIIMIIMI lllllll IIMIII I 

ctttgaagccacctccagggccagaaactaatggagaggacccccttccagcatgcacac 317 
ccagtccacaagacct caaagagt tggagt ttctgacccaggcactggagaaggctgtac 2 72 

IIIIIIIIMIIIMIIIMIMII IIIIIIIIIIMKIIIIIIIIIIIII IIIIIMI 

ccagtccacaagacctcaaagagt tggagt ttctgacccaggcactggagaaggctgtac 3 77 
gagttcgaagaggcatcactaaggccgaagagagagacaaggcccccagcctgaaatcta 33 2 

III IIMM I II II llll II I II 1 1 IMI! MM I II II I! 111! I III II 1 1 IIMM 

gagttcgaagaggcatcactaaggccggagagagagacaaggcccccagcctgaaatcta 43 7 
ggtccattgtcacctcttctggcacgacagcctccgccccaccgcattccccaggccaag 392 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ) 1 1 ! I M 

ggtccattgtcacctcttctggcacgacagcctccgccccaccgcattccccaggccaag 497 
ctggtggccatgcttcagacacgagacccaccaagggcctccgccagaccacggtgcctg 452 

1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 i II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 

ctggtggccatgcttcagacacgagacccaccaagggcctccgccagaccacggtgcctg 557 
ccaagggccaccctgagcgccggctgctgtcagtgggggatgggacccgtgttgggatgg 512 

1 1 f 1 1 1 1 1 1 1 1 lllllll II 1 1 1 1, 1 1 II MINIMI 1 1 IN II II I llll 1 1 1 MM I 

ccaagggccaccctgagcgccggctgctgt cagtgggggatgggacccgtgt tgggatgg 617 
gagcccgaacccccaggcctggggcgggcctcagggaccagcaaatggccccatccgctg 572 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f r 1 1 1 1 1 j 1 1 1 1 1 1 1 j f 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 e 1 1 1 1 1 1 1 1 1 

gagcccgaacccccaggcctggggcgggcctcagggaccagcaaatggccccatccgctg 677 
ctcctcaggccccagaagccttcacactcaaggagaaggggcacctgctgcggctgcctg 63 2 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ctcctcaggccccagaagccttcacactcaaggagaaggggcacctgctgcggctgcctg 737 
cggcattcaggaaagcagcttcccagaactcgagcctgtgggcccagctcagttccacac 692 

1 1 f 1 1 f 1 1 1 1 1 1 ] 1 1 1 1 1 1 1 j r 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 

cggcattcaggaaagcagcttcccagaactcgagcctgtgggcccagctcagttccacac 79 7 
agaccagtgattccacggatgccgccgctgccaaaacccagttcctccagaacatgcaga 752 

II 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 III 1 1 1 II I II 1 1 1 II I II 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 

agaccagtgattccacggatgccgccgctgccaaaacccagttcctccagaacatgcaga 857 
cagcttcaggcgggccccagcccaggctcagtgctgtggaggtggaggcggaggcggggc 812 

II lllllll MIIMIII II Mill 1 1 MINIMI Nil imitMIIIII MIMII 

cagcttcaggcgggccccagcccaggctcagtgctgtggaggtggaggcggaggcggggc 917 
gcctgcggaaggcctgctcgctgctgagactgcgcatgagggaggagctctcagcagccc 872 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

gcctgcggaaggcctgctcgctgctgagactgcgcatgagggaggagctctcagcagccc 977 
ccatggactggatgcaggagtaccgctgcctgctcacgctggaggggctgcaggccatgg 932 

III I II! II 1 1 II lllllll 1 1 II II I Mil Mill III I III I II Ml II 1 1 IIMM I 

ccatggactggatgcaggagtaccgctgcctgctcacgctggaggggctgcaggccatgg 103 7 
tgggccagtgtctgcacaggctgcaggagctgcgtgcagcggtggcggaacagccaccaa 992 

III III I II II II llll III II II 1 1 IIMM lllllll! II II II I IIMM Mill M 

tgggccagtgt c tgcacaggc tgcaggagctgcgt gcagcggtggcggaacagccaccaa 1097 
gaccatgtcctgtggggaggccccccggagcctcgccgtcctgtgggggtagagcggagc 1052 

MM lllllll llllllllll III 1 1 1 1 1 1 f I } 1 1 1 f 1 1 1 1 1 1 1 1 1 MIMII MIMII 

gaccatgtcctgtggggaggccccccggagcctcgccgtcctgtgggggtagagcggagc .1157 
ctgcatggagcccccagctgcttgtctactccagcacccaggagctgcagaccctggcgg 1112 

III I Mill II IIIIIHIIMM MIMII lllllim IIMM IIIIMI il lllll 

ctgcatggagcccccagctgcttgtctactccagcacccaggagctgcagaccctggcgg 1217 
ccctcaagctgcgagtggctgtgctggaccagcagatccacttggaaaaggtcctgatgg 1172 

IIIIIIIMIIIMIIIIIIIMIII IMIIIMIIMMIIII IMIIIIIIMIIIII 

ccctcaagctgcgagtggctgtgctggaccagcagatccacttggaaaaggtcctgatgg 127 7 
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Query: 1173 ctgaactcctccccctggtaagcgctgcacagccgcaggggccgccccggctggccccgt 1232 

Ml IMMIIIIII lllilllMIIIMIIIMIIIIIIIMIIIIIIIIIIIIIIilll 

Sbjct: 1278 ctgaa-ctcctccccctggtaagcgctgcacagccgcaggggccgccctggctggccctgt 133 7 
Query: 1233 gccgggctgtgcacagcctgctctgcgagggaggagcacgtgtccttaccatcctgcggg 1292 

I i f 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 i f 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 ) 1 1 1 j I i 1 1 f 1 1 1 1 1 i 1 1 1 1 

Sbjct: 133B gccgggctgtgcacagcctgctctgcgagggaggagcacgtgtccttaccatcctgcggg 1397 
Query: 12 93 atgaacctgcagtctgagcctttcccatgctgccctcggc 1332 

. IIMIIIIIIIIMIIIIIIIIIIIIIIII IIMIMIII 

Sbjct: 1398 atgaacctgcagtctgagcctttcccatgctgccctcggc 1437 

A BLASTP search was performed against public protein databases. As shown in Table 
7D, the AMF7 protein has 78 of 332 amino acid residues (23 %) identical to, and 1 13 of 332 
residues (34 %) positive with, the 1 1 51 amino acid residue long Gallus gallns (chicken), high 
molecular mass nuclear antigen (fragment) (Acc. No. 057580) (SEQ ID. NQ;79). 

Table 7D. BLASTP of AMF7 against chicken HMMNA (SEQ ID NO:79) 

O57580 gallus gallus {chicken) . high molecular mass nuclear- antigen (fragment) . 
11/1998 Length = 1151 

Score ~ 43 . 8,. bits M 101.. 0) Expect - 0 . 002 

Identities .- . 78/332 (23%) , Positives ~- 113/332 , . (34%) , 

Query: 44 WEPTGTRALKPPPGPE-TNGEDPLPACTPSPQDLKELEFLTQALEKAVRVRRGITKAEER 102 

I I I -till - I \ I + |.+ - II I . .V. * | I 

Sbjct: 52 WVPIG- -GAPPPPGTEPTPPSKPTDGADAAPKASAELTSPPPASPSPPDGPKAPSGAGEA 109 

Query: 103 , D KAP S LKS RS I VT S S GTTAS AE P HS PG Q AGGS VGDQTR VGMG ART PRPGAGLRDQQM 159 

1+ I M I I II I I I I I ++ +11 

Sbjct: 110 E^TPPPSQG-------PAGTPPPSQGAAG^K^ 162 

Query: ISO AHASDTR PTKGLRQTTVPAKGHP ERRLLP S AAPQAPEAFTLKE KGHLLRLPAAFRKAAS Q 219 

l ++ II III f I + + 1 + H + l I II Nil 

Sbjct: 163 TAATEARPASA- ASPTVP- KATAEATAVTAASQSAPKAAT- - .-- -DAAAVTAASQ 210 

Query: 220 NS-SLWAQLSSTQTSDSTDAAAAKTQFLQNMQTASGGPQPRLS 261 

++ ++..+++11 -Mill 

Sbjct: 211 S APKATVE VKP AAAAVAKEAKAVTAAAAAP KATAKAKPAPVTS PTI PCSSAEAKPLTAAS 270 

Query: 262 - -AVEVEAEAGRLRKACSLLRLRMRJ3ELSAAPMDW 319 

I + III + ||+ ++ I II + + + + I + 

Sbjct: 271. - P TAS KATAEAKP VP ATAS LMAT KVTABAJCP AP S P S VP - - KATTD T KAVTATAP KAG P D VK 328 

Query: 3 20 . ■ ELRAAVAEQPPRPCPVGRPPGASPSCGGRAEP 351 

I II I I I II I II 

Sbj.Ct :.. 3 29 PAVAVCAEAKPAPPP PPQQLPKAAAAAAP 357 



50 



AMF7 also is 1 6% identical to and 2 1 % positive with Interleukin- 1 1 Precursor (IL- 1 1 ) 
(Adipogenesis InhibitorY Factor) (AGIF) (GenBank Acc. No. P20809) (SEQ ID NO:80) 
shown in Table 7E. 



55 



Table 7E. BLASTP of AMF7 against EL- 11 Precursor (SEQ ID NO:80) 



Identities: 0 . 16; Similarities : 0.21; Similarity Matrix: BLOSUM62 

10 20 30 40 50 

iii'A I ;* * * I I I I 1^* * I I — • I ; • • -I 

AMF7 AFl^PAcSs wOTSAELQGALiDAGAQRQIjQLEQS IiRVCRRlSlAfepT^RSLKP P£ 

Consensus M laffl 




60 

•I 
3PP 

Jet 



35 
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hIL-llpre 
AMF7 

Consensus 



hIL-llpre 
10 AMF7 

Consensus 



15 



20 



25 



30 



hIL-llpre 
AMF7 

Consensus 



hIL-llpre 
AMF7 

Consensus 



hIL-llpre 
AMF7 

Consensus 



hIL-llpre 
AMF7 
35 Consensus 



nIL-llpre 
40 AMF7 

Consensus 



70 80 90 100 110 

B. g p _jOT Sp _ _ _ _ra„ _ra ra_ 

aEDPLPACTPSgODLKELEFLTQALEKaV^RRGITKAEERfflKASsiiKS 



120 
••I 



3IVTSSGTT 



130 



140 



I 



150 



160 



170 



180 



LD--STV-L L H 

AP PHgPGQAGGS VGDGgRVGMG, 



I 



190 



BHNLDS 
HPfeRRL 





I 



220 



I 



230 



240 



G--V-IJ- 

RKAASQNSSLWAQBS: 



... |- | 

-HRL 

sHqtsdstdaaa 



250 260 270 280 290 300 

....|.....|....|....|.,..|....|....|....|....|....|....|....| 

-RA DliLS'YL gH gQ W-HgRgGdSSLKT,LEPgJq 

AKmFLQtftQTASGGPQPBliSAVEBEAEAGR^^ 



360 




420 



£ G-£R^~ -U jJjlLGG'j 

SSTQEIiQTLAAIiKLRVAVLDQQIHLEKVLM^ 




430 
|....|....| 
WAVSG[JLL|1KTR- -£ 
1RX>EPAV 




The presence of identifiable domains in AMF7, as well as all other AMFX proteins, can 
be determined by searches using software algorithms such as PROSITE, DOMAIN, Blocks, 

45 Pfam, ProDomain, and Prints, and then determining the Interpro number by crossing the 
domain match (or numbers) using the Interpro website fhttp:www.ebi.ac.uk/ interpro) . 
DOMAIN results can then be collected from the Conserved Domain Database (CDD) with 
Reverse Position Specific BLAST analyses. This BLAST analysis software samples domains 
found in the Smart and Pfam collections. 

50 Expression information for AMFX RNA was derived using tissue sources including, 

but not limited to, proprietary database sources, public EST sources, literature sources, and/or 
RACE sources, as described in the Examples. AMF7 is expressed in at least the following 
tissues: colon, ovarian, lung, renal and breast cancer. The expression in lung and renal cancer 
cell lines correlates with expression in the fetal tissues, indicating a oncofetal phenotype.. 
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The nucleic acids and proteins of AMF7 are useful in potential therapeutic applications 
implicated in various AMF-related pathologies and/or disorders. For example, a cDNA 
encoding the IL-1 1-like protein may be useful in gene therapy, and the IL-1 Mike protein may 
be useful when administered to a subject in need thereof. The novel nucleic acid encoding 
5 AMF7 protein, or fragments thereof, may. further be useful in diagnostic applications, wherein 
the presence or amount of the nucleic acid or the protein are to be assessed. These materials : 
are-further useful in the generation of antibodies that bind immunospecifically to the novel 
substances of theinvention for. use in: therapeutic or diagnostic methods; 

The AMFX nucleic acids and proteins are useful in potential diagnostic andtherapeutic 

10- applications implicated in various diseases and disorders described below and/or other 

pathologies. For example^ the compositions o£the present invention will have efficacy for 
treatment of patients suffering from: diseases involving the growth of hematopoietic progenitor 
cells and platelet maturation, lung and renal cancer- and otiier diseases, disorders and j 
conditions of the like.: By : way of nohlimiting example, the compositions of the present 

1 S invention will have efficacy for treatment of patients suffering from diseases involving the 
growth of hematopoietic FrogenitorcellsLand plateletmaturation y lung and renal cancer . 
Additional AMF-related diseases and disorders are mentioned throughout the Specification;. 

Further, the protein similarity informatibn, expression pattern, and map location for 
AMF7 suggests that AMF7 may have important structural and/or physiological functions 

20 characteristic of the AMF family. Therefore, the nucleic acids and proteins of the invention are 
useful in potential diagnostic and therapeutic applications and as a research tool. These include 
serving as a specific or selective nucleic acid or protein diagnostic and/or prognostic marker, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed, as well as 
potential therapeutic applications such as the following: (i) a protein therapeutic, (ii) a small 

25/ molecule drug target* (iii) an antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), (iv) a nucleic acid useful in gene therapy (gene delivery/gene ablation), and (v) a 
composition promoting tissue regeneration in vitro and in vivo (vi) biological defense weapon. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the novel AME7 substances for use in therapeutic or diagnostic 

30 methods.. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Antir AMFX Antibodies" section 
below. In various embodiments, contemplated AMF7 epitopes are hydrophilic regions of the 
AMF7 polypeptide as predicted by software programs well known in the art that generate 
hydrophobicity or hydrophilicity plots. 
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AMF-8 (also referred to as Acc. No, AC01136_A) 

AMF1 is a novel pleitrophin-like polypeptide. The AMF1 clone is alternatively 
referred to herein Acc. No. AC01 136_A. The AMF1 nucleic acid (SEQ DDNO:15) of 510 
nucleotides is shown in Table 8 A. The AMF1 open reading frame ("ORF") (SEQ ID NO: 16) 
begins at nucleotide 1 . The AMF 1 ORF terminates at a TAA codon at nucleotides 510-513. 
The AMF 1 protein was predict to be a secreted protein. 

Table 8A: AMF-8 DNA (SEQ ID NO:15) AND POLYPEPTIDE (SEQ ID NO:16) 

Translated Protein - Frame: 1 -Nucleotide 1 to 510 

ATG CAGGC TC AACAGT ACCAG CAG CAG CGT CGfiA?^TTTGCAGCTGCCl"lV"l"lXiG CA.TTCfi.TI TT C ATACTGG CAG CTGT 8 0 

MQAQQYQQQRRKFAAAFLAFIFILAAV 

G GAT ACTGCTG AAG CAGGGAAG AAAGAG AAACCAG AAAAAAAAGTGAAGAAGTCTGACTG TGG AG AAT G GCAG TGG AGTG 1€ 0 

D TAEAGKKBKPEKKVKKSDCGE WQWSV 
TGTGTGTGCCCACCAGTGGAGACTGTGGGCTGGGCACACGGGAGGGCACTCGGACTGGAGCTGAGTGCAAGO 24 0 

CVPTSGDCGIiGTREGTRTGA'ECKQTK 
AAGACCCAGAGATGTAAGATCCCCT^CAACTGGAAGAAGCAATTTGGCGCGG 320 
KTQRCKI P CNWKKQFGAECKY QF QAWG 
AGAATGTGACCTGAACACAGCCCTGAAGACCAGA^ 4 00 

ECDLNTALKTRTGSIiKRALHNAECQKT 
CTGTCACCATCTCCAAGCCCTGTGGCAAACTGACCAAGCCC^ 4 80 

VTI S K PCGKL TKPKPQGTLELKVKKK 
AAAAAAAAAAAAAATT CTGAGGAGAC CTTTTAG " 513 
KKKKNSEETF 

BLASTN information for AMF8-related nucleic acid sequences is shown in Table 8B. 
Table 8B BLASTN analysis results for AMF8 

Score E 

Sequences producing significant alignments: (bits) Value ~ ; 

HUMHBNF1 M57399 Human nerve growth factor (HBNF-1) mRNA, com... 894 0.0 
HSHBGF8 X52946 Human pie iotrophin (PTN) mRNA. 9/1993 894 0.0 

AB004306 AB004306 Homo sapiens mRNA for osteoblast stimulati... 894 0.0 
D89546 D89546 Porcine mRNA for pleiotrophic factor beta, com. . . 618 e-175 
BTHBGF8 XS2945 Bovine pleiotrophin (PTN) mRNA. 9/1993 609 e-172 

RATHBGAM M55601 R.norvegicus heparin-binding growth associat . . . 531 e-148 
MUSOSF1 D90225 Mouse mRNA for OSF-1. 6/1999 502 e-139 

In an analysis of public nucleic acid sequence databases, it was found, for example, that 
the AMF1 nucleic acid sequence has 541/541 bases (100%) identical to human nerve growth 
factor (GenBank Acc. No. M57399) (SEQ ID NO:81) shown in Table 8C. In all BLAST 
alignments herein, the "E- value" or "Expect" value is a numeric indication of the probability 
that the aligned sequences could have achieved their similarity to the BLAST query sequence 
by chance alone, within the database that was searched. For example, as shown in Table 8B, 
the probability that the subject ("Sbjct") retrieved from the AMF1 BLAST analysis, in this 
case the human nerve growth factor gene, matched the Query AMF1 sequence purely by 
chance is zero as shown by the E value of 0.0. 
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Table 8C. BLASTN of AMF1 against human NGF (SEQ IB NO:81) 

>HUMHBNF1 M57399 Human nerve growth factor (HBNF-1) mRNA, complete cds. 4/1993 
Length = 1029/ Strand * Plus / Plus 

Score = 894 bits (451), Expect =0.0 

Identities = 451/451 (100%) 

atgcaggctcaacagtaccagcagcagcgtcgaaaatttgcagctgccttcttqgcattc 60 

1 1 lllllllllllllllllllllllll.l Mill lllllllllll.il 1 1 III II II Mill 

atgcaggctcaacagtaccagcagcagcgtcgaaaatttgcagctgccttcttggcattc 4 55 
attttcatactggcagctgtggatactgctgaagcagggaagaaagagaaaccagaaaaa 120 

I MINI IIMIMMIIIIIIIIMIIIIIII.il Mill I III II 1 1 III II II Mill 



Query: 


1 


Sb j ct : 


396 


Query: 


61 


Sbjct : 


456 


Query: 


121 


Sbjct : 


516 


Query: 


181 


Sb^ct: 


576- 


Query: 


241 


Sbjct: 


636- 


Query: 


3 01 


Sbjct: 


6*6 


Query: 


3 61 


Sbjct:; 


756. 


Query: 


421 


Sbjct: 


816 



^aagugaagaagcctgactgcggagaatggcagtggagtgtgtgtgtgcccaccagtgga 180 

1 1 M 1 1 1 II I II II II M I J I M : l II M M.I.I I J I.J 1 1 1 1 II 1 1 1 1 II I M || 1 1 II 1 1 1 

aaagtgaagaagtctgactgtggagaatggcagtggagtgtgtgtgtgcccaccagtgga 575 

gactg^gggctgggcacacgggagggcactcggactggagctgagtgcaagcaaaccatg 24 0 

I U I I I M II I I I I I I I I I II I I I I II I: I.I M I I I I II II I I I II I II II II I I | | | || | 

gactgtgggctgggcacacgggagggcactc^actg^agctgagtgcaagcaaaccatg 63 5 

aaga'cccagagatgtaagatcccctgcaactggaagaagcaatttggcW 300 

I I LU 1 1 1 1 1 1 1 1 M I M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 J I M M J I Ml j I III 1 1 1 M I i 1 1 1 1 



~J ~J _J J — J"— — — J w» WW w »3MX<]MVV«MaBV \r 3 3 C* 

MINI I II Mill MM I II II III I Mllllltllll.il llllllllllll 



360 

755 



I I IN I III II II Ml Hill II II III Mill M Ml MM II MMII II III IMM 

:agtctgaagcgagccctgcacaatgccgaatgccagaagactgtcaccatctccaagccc..815 - 
tgtggcaaactgaccaagcccaaacctcaag 451. !\; _M . 

1 1 1 MINIM MIMIlll I MMIM IN 

tgtggcaaactgaccaagcccaaacctcaag .84.6-- ,r . 

A BLASTP search was performed against public protein databases. The results from 
this comparison are shown in Table 8D. 



Score 


E 


(bits) 


Value 


92 


2e-18 


92 


2e-18 


79 


le-14 


71 


5e-12 


70 


8e-12 



Table 8U> BLASTP amalysis results for AJMIF8 

Sequences producing significant alignments: 

FGFJ_HUMAN 095750 homo sapiens (human) . fibroblast "growth fa. . . 
O9S750 O95750 homo sapiens (human), fgf-19. 5/1999 
FGFF_MOUSE 035622 "mus musculus (mouse), fibroblast growth fa... 
FGF3_MOTJSE P05524 mus musculus (mouse), int-2 proto- oncogene .. . 
FGF3_HDMAN" PI 14 87 homo sapiens (human), int-2 proto - oncogene. . . 

For example, as shown in Table 8E, the AMF8 protein has 57 of 143 amino acid 
residues (39 %) identical to, and 79 of 143 residues (54 %) positive with, the 216 amino acid 
residue long human fibroblast growth factor. (Acc. No. 095750) (SEQ ID NO:82). 

. Table 8E. BLASTP of AMF1 against human FGF (SEQ IB NO:82) 

>FGFJ_HUMAN 095750 homo sapiens (human) . fibroblast growth factor-19 precursor (fgf- 
19) . 10/2000 Length = 216 

Score =92.1 bits (225), Expect = 2e-18 

Identities 57/143 (39%) , .Positives. = 79/143 (54%), Gaps ^ 6/143 (4%) 
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Query: 15 VSVLAGLLLGACQAHPIP- -DSSPLLQFG- -GQVRQRYLYTDDAQQ-T3AHLEIREDGTV 69 

I + l 1 1 1 I I 1+ 1+ I + +| +11 + 111 + I II II I 

Sbjct: 10 WJI LAGLWL - AVAGRPLAFSDAGPHVHYGWGDP IRLRELYTSGPHGLSSCFLR I RADGW 68 

5 Query: 7 0 GGAADQSPESLLQLKALKPGVIQILGVXTSRFLCQRPDGALYGSLHFDPEACSFRELLLE 12 9 
I M III++II+ +111 + l + H II + M+ 11 + 11 + 
Sbjct: 69 DCARGQSAHSLLEIKAVALRTVAIKGVHSVRYLCMGADGKT^IQGLLQYSEEDCAFEESIRP 12 8 

Query: 13 0 DGYNVYQSEAHGLPLHLPGLQRR 152 

10 MM 11+11 I 11+ I ++I 

Sbjct: 12 9 D G YNVYR S E KHRLP VS LS S AKQR 151 

Expression information for AMFX RNA was derived using tissue sources including, 
but not limited to, proprietary database sources, public EST sources, literature sources, and/or 

1 5 RACE sources, as described in the Examples. AMFf is expressed in at least the following 

tissues, several brain tumor cell lines and fetal derived tissue. The nucleic acids and proteins 
of AMF1 are useful in potential therapeutic applications implicated in various AMF-related 
pathologies and/or disorders. For example, a cDNA encoding the pleiotrophin-like protein 
may be useful in gene therapy, and the pleiotrophin-like protein may be useful when ^ 

20 administered to a subject in need thereof. The novel nucleic acid encoding AMF1 protein, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. These materials are further useful 
in the generation of antibodies that bind immunospecifically to the novel substances of the 
invention for use in therapeutic or diagnostic methods. 

25 The AMFX nucleic acids and proteins are useful in potential diagnostic and therapeutic 

applications implicated in various diseases and disorders described below and/or other 
pathologies. For example, the compositions of the present invention \yill have efficacy for 
treatment of patients suffering from cancer and other cell proliferative disorders. By way of 
nonlimiting example, the compositions of the present invention will have efficacy for treatment 

30 of patients suffering from cancer and other cell proliferative disorders. Additional AMF- 
related diseases and disorders are mentioned throughout the Specification. 

Further, the protein similarity information, expression pattern, and map location for 
AMF1 suggests that AMF1 may have important structural and/or physiological functions 
characteristic of the AMF family. Therefore, the nucleic acids and proteins of the invention are 

35 useful in potential diagnostic and therapeutic applications and as a research tool. These include 
serving as a specific or selective nucleic acid or protein diagnostic and/or prognostic marker, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed, as well as 
potential therapeutic applications such as the following: (i) a protein therapeutic, (ii) a small 
molecule drug target, (iii) an antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
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antibody), (iv) a nucleic acid useful in gene therapy (gene delivery/gene ablation), and (v) a 
composition promoting tissue regeneration in vitro and in vivo (vi) biological defense weapon. 

These materials are further useful in the generation of antibodies that bind 
immunospecificaily to the novel AMF1 substances for use in therapeutic or diagnostic 
5 methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Ariti-AMFX Antibodies" section 
below. In various embodiments, contemplated AMF1 epitopes are hydrophilic regions of the 
AMF1 polypeptide as predicted by software programs Stfell known in the art that generate 
hydrophobicity or hydrbphittcity plots. - — - ; . 

10 AMF-9 (also referred to as Acc. No. AL307658) 

AMF9 is a novel GPCR-like polypeptide. The AMF9 clone is alternatively referred to 
herein Acc. No, AL307658, The AMF9 nucleic acid (SEQ ID NO: 17) is shown in Table 9A^ 
The AMF9- open reading frame ("ORF' ? ) (SEQ ID NO: 18) encodes foia 94 amino acid protein. 
The AME9 polypeptide is encoded in a negative reading frame^The sequence shown below 
1 5 has been reverse-complemented and renumbered to allow reading of the pratein in the 
expected N to C direction. ■_ . . : : : - : :W. ^. % . : : .. 

Table 9A: AMF-9 DNA (SEQ ID NO: 17) and Polypeptide (SEQ ID NO:18) 

Translated Protein - Frame: -1- -■ Nucleotide 16 tb'297 ^ 

20 CGAAGGGCTTTCACAATGCTA!C^TGTGGTCTGGCT(^ 80 

. MLGVVWL. VAVIVQ.SPMWHVQQL 

TGAGATCaUU^TATGACTTGCTATATGAAAAGGAACACATCTGC^^ 160 - 

EI KYDFLYEKEHICCLEEWTSPVH'QKI 
TCTACACCACCTTCATCCTTQTCATCCTC^ 240 

^ Y T T F I L V I L F L L P L ME E E T S C — H Y ' D G D 

AGTGGTGGCTCTCTTTGCTOT GfTGOTG^ . 320 

S G G S L C C V £ g J F""T § C C P Y D D 

AATATGATGATGTCACAATCAAC^TGATTT^ 400 
3 0 GTCT ATG CATTTATGAATGAAAACTTCAAAAA 432 

A BLASTN analysis produced no significant homologies, as shown in Table 9B below. 
In all BLAST alignments herein, ther"E-value' * or "Expect" value is a numeric indication of the 
probability that the aligned sequences could have achieved their similarity to the BLAST query 
35 sequence by chance alone, within the database that was searched. 

Table 9B: BLASTN alignment results for AMF9 



Matching 

Entry (in 

GenBank 
Main) 


Be gin- 
End 


Description- 


Score 


£ 

Value 


gb:AL079305 
CNS00M8M 


[255- 
276] 


Human chromosome 14 DNA sequence *** IN PROGRESS 
*** BAC R-306B9 of library RPCM 1 from chromosome 14 
of Homo sapiens (Human), complete sequence. 


44.1 


0.059 
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gb:AP001729 
AP001729 


[219- 
240] 


Homo sapiens genomic DNA, chromosome 2 1 q, section 
73/105. 


44.1 


0.059 


gb:AP001436 
AP001436 


[219- 
240] 


Homo sapiens genomic DNA, chromosome 2 lq22. 2, 
cIone:T556, LB7T-CRG region, complete sequence. 


44.1 


0.059 


gb:AP0001 56 
AP0001 56 


[219- 
240] 


Homo sapiens genomic DNA, chromosome 21 q22 .2, DSCR 
region, clone D47-S479, segment 8/16, complete sequence. 


44.1 


0.059 


gb:AP000014 
AP000014 


[219- 
240] 


Homo sapiens genomic DNA of 21q22.2 Down Syndrome 
region, segment 7/13. 


44.1 


0.059 


gb:L21977 
PETAC02A 


[276- 
297] 


Petunia hybrida potential l-aminocyclopropane-l-carboxylate 
oxidase (AC02) pseudogene sequence. 


44.1 


0.059 



A BLASTP search was performed against public protein databases. The results from 
this comparison are shown in Table 9C. In both Table 9B and Table 9C ? as indicated by the 
fact that all resulting E values are higher than 0.001, no database entries were identified that 
had highly significant homologies to AMF9, i.e. y that at least one subject sequence within the 
public databases searched would have homology to the AMF9 Query sequence, due to chance 
alone, would be more frequent than 1 in 1000. 

Table 9C: BLASTP alignment results for AMF9 



Matching Entry 
(in SwissProt + 
SpTrEMBL) 


Begin 
-End 


Description 


Score 


E 

Value 


spt:Q62805 
GALR_RAT 


[2-64] 


GALANIN RECEPTOR TYPE 1 (GAL1-R) (GALR1). 


40.2 


0.003 


spt:P56479 
GALR_MOUSE 


[2-64] 


GALANIN RECEPTOR TYPE 1 (GAU-R) (GALR1). 


40.2 


0.003 


spt:P50391 
NY4R_HUMAN 


[4-63] 


NEUROPEPTIDE Y RECEPTOR TYPE 4 (NPY4-R) 
(PANCREATIC POLYPEPTIDERECEPTOR 1) (PP1). 


39.1 


0.008 


spt:Q9Z2D4 
Q9Z2D4 


[4-63] 


PANCREATIC POLYPEPTIDE RECEPTOR Y4. 


39.1 


0.008 


spt:Q61041 
NY4R_MOUSE 


[4-63] 


NEUROPEPTIDE Y RECEPTOR TYPE 4 (NPY4-R) 
(PANCREATIC POLYPEPTIDERECEPTOR 1)(PP1) 
(NPYR-D). 


37.9 


0.017 


spt:073734 
073734 


[2-64] 


NEUROPEPTIDE Y/PEPTIDE YY RECEPTOR YC. 


37.5 


0.023 


spt:O97505 
097505 


[4-63] 


NEUROPEPTIDE Y RECEPTOR TYPE 4. 


37.5 


0.023 


spt:Q22995 
Q22995 


[3-62] 


SIMILAR TO FAMILY 1 OF G-PROTEIN COUPLED 
RECEPTORS. 


37.5 


0.023 

1 



10 



15 



20 



For example, as shown in Table 9D, the AMF9 protein has 18 of 63 amino acid 
residues (29 %) identical to, and 33 of 63 residues (52 %) positive with, the 346 amino acid 
residue long rat galanin receptor type 1 (SEQ ID NO: 83). 

Table 9D. BLASTP of AMF9 against rat galanin receptor type 1 (SEQ ID NO:83) 

GALR_RAT rattus norvegicus (rat) . galaniji receptor type 1 (gali-r) (galrl) . 7/1998 
Length = 346, Score = 40.2, bits (92.0), Expect = 0.003 
Identities = 18/63 (29%), Positives = 33/63, (52%) 

Query: 2 LGVVWIjVAVIVGSPMWHVQQLEIKYDFIjYEK^ 61 

+1 +1 +++ + 11+ + |+| i + 111+ +1 + 1 I I +1 I 

Sbjct: 155 VGFI WAL.S I AMAS ? VAYYQFLL FHRD SNQT FCWE HW PNQLHKKA YVVCT FVFG YLL 209 
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Query: 62 PL.M 64 

11 + 

Sbjct: 210 PLL 212 

The nucleic acids and proteins of AMF9 are useful in potential therapeutic applications 
implicated in various AMF-related pathologies and/or disorders. For example, a cDNA 
encoding the GPCR-like protein may be useful in gene therapy^ and the GPCR-like protein 
may be useful when administered. to a subject in need thereof. The novel nucleic acid 
encoding AMF9 protein, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid ox the protein are tabe assessed. These 
materials are further useful in the generation of antibodies that bind immunospecifically to the 
novel substances, of the invention for use in therapeutic or diagnostic methods. 

/ The AMFX nucleic acids, and proteins are.usefiil in potential diagnostic and therapeutic 
applications implicated in - various .diseases and disorders described below- and/or other, 
pathologies. For example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from cancer and other cell proliferative disorders. By way of 
nonlimiting example, the compositions of the present invention will have efficacy for treatment 
of patients suffering from cancer and other cell proliferative disorders. Additional AMF- 
related diseases and disorders are mentioned throughout the. Specification. 

Further, the protein similarity information, expression pattern, and map location for 
AMF9 suggests that AMF9 may have important structural and/or physiological functions 
characteristic; of the AMF family. Therefore, the nucleic acids and proteins of the invention are 
useful in potential diagnostic and therapeutic applications and as a research tool. These include 
serving as a specific 6i selective nucleic acid or protein diagnostic and/or prognostic marker, 
wherein the presence or amount of the nucleic aeidor the protein are to be assessed, as well as 
potential therapeutic applications such as the following: (i) a protein therapeutic, (ii) a small 
molecule drug target, (iii) an antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), (iv) a nucleic acid useful in gene therapy (gene delivery/gene ablation), and (v) a 
composition promoting tissue regeneration in vitro and in vivo (vi) biological defense weapon. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the novel AMF9 substances for use in therapeutic or diagnostic 
methods/These antibodies may be generated according to. methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Antir AMFX Antibodies" section 
below. In various embodiments, contemplated AMF9 epitopes are hydrophilic regions of the 
AMF9 polypeptide as predicted by software programs well known in the art that generate 
hydrophobicity or hydrophilicity plots. 
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AMF-10 (also referred to as Acc. No. G55707_A) 

AMF10 is a novel growth/differentiation factor-6-like polypeptide. The AMF10 clone 
is alternatively referred to herein Acc. No. G55707 A The AMF10 nucleic acid (SEQ ID 
NO: 19) of 1425 nucleotides is shown in Table 9A. The AMF10 open reading frame ( :t ORP") 
5 (SEQ ID NO:20) begins at nucleotide 3 1. The AMF 1 0 ORF terminates at a TAG codon at 
nucleotides 1396-1398. The AMF 10 protein was predict to be a secreted protein. The 
program Signal? predicts a signal peptide with the most likely cleavage site between amino 
acids 22 and 23. The predicted molecular weight of the AMF 10 polypeptide is 50677 Da. 

Table 10A: AMF-10 DNA (SEQ ID NO:19) and Polypeptide (SEQ ED NO;20) 

10 CTC CTG GGG AGA CGC AGC CAC TTG CCC GCC ATG GAT ACT CCC AGG 4 5 

■ Met Asp Thr Pro Arg 



15 



30 



45 



GTC CTG CTC TCG GCC GTC TTC CTC ATC AGT TTT CTG TGG GAT TTG 90 
Val Leu Leu Ser Ala Val Phe Leu lie Ser Phe Leu Trp Asp Leu 

CCC GGT TTC CAG CAG GCT TCC ATC TCA TCC TCC TGT TCG TCC GCC 135 
Pro Gly Phe Gin Gin Ala Ser lie. Ser Ser Ser Cys Ser Ser Ala 



GAG CTG GGT TCC ACC AAG GGC ATG CGA AGC CGC AAG GAA GGC AAG 180 
20 Glu Leu Gly Ser Thr Lys Gly Met Arg Ser Arg Lys Glu Gly Lys 

ATG CAG CGG GCG CCG CGC GAC AGT GAC GCG GGC CGG GAG GGC CAG 225 
Met Gin Arg Ala Pro Arg Asp Ser Asp Ala Gly Arg Glu Gly Gin 

25 GAA CCA CAG CCG CGG CCT CAG GAC GAA CCC CGG GCT CAG CAG CCC 270 

Glu Pro Gin Pro Arg Pro Gin Asp Glu Pro Arg Ala Gin Gin Pro 



CGG GCG CAG GAG CCG CCA GGC AGG GGT CCG CGC GTG GTG CCC CAC 315 
Arg Ala Gin Glu Pro Pro Gly Arg Gly Pro Arg Val Val Pro His 

GAG TAC ATG CTG TCA ATC TAC AGG ACT TAC TCC ATC GCT GAG AAG 350 
Glu Tyr Met Leu Ser lie Tyr Arg Thr Tyr Ser lie Ala Glu Lys 



CTG GGC ATC AAT GCC AGC TTT TTC CAG TCT TCC AAG TCG GCT AAT 405 
35 Leu Gly He Asn Ala Ser Phe Phe Gin Ser Ser Lys Ser Ala Asn - 

ACG ATC ACC AGC TTT GTA GAC AGG GGA CTA GAC GAT CTC TCG CAC 4 50 
Thr He Thr Ser Phe Val Asp Arg Gly Leu Asp Asp Leu Ser His 

40 ACT CCT CTC CGG AGA CAG AAG TAT TTG TTT GAT GTG TCC ATG CTC 495 

Thr Pro Leu Arg Arg Gin Lys Tyr Leu Phe Asp Val Ser Met Leu 



TCA GAC AAA GAA GAG CTG GTG GGC GCG GAG CTG CGG CTC TTT CGC 54 0 
Ser Asp Lys Glu Glu Leu Val .Gly Ala Glu Leu Arg Leu Phe Arg 

CAG GCG CCC TCA GCG CCC TGG GGG CCA CCA GCC GGG CCG CTC CAC 58 5 
Gin Ala Pro Ser Ala Pro Trp Gly Pro Pro Ala Gly Pro Leu His 



GTG CAG CTC TTC CCT TGC CTT TCG CCC CTA GTG CTG GAC GCG CGG 63 0 
50 Val Gin Leu Phe Pro Cys Leu Ser Pro Leu Leu Leu Asp Ala Arg 

ACC CTG GAC CCG CAG GGG GCG CCG CCG GCC GGC TGG GAA GTC TTC 6 75 
Thr Leu Asp Pro Gin Gly Ala Pro Pro Ala Gly Trp Glu Val Phe 

55 GAC GTG TGG CAG GGC CTG CGC CAC CAG CCC TGG AAG CAG CTG TGC 72 0 

Asp Val Trp Gin Gly Leu Arg His Gin Pro Trp Lys Gin Leu Cys 

TTG GAG CTG CGG GCC GCA TGG GGC GAG CTG GAC GCC GGG GAG GCC 765 
Leu Glu Leu Arg Ala Ala Trp Gly Glu Leu Asp Ala Gly Glu Ala 
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GAG GCG CGC GCG CGG GGA CCC CAG CAA CCG CCG CCC CCG GAC CTG 
Glu Ala Arg Ala Arg Gly Pro Gin Gin Pro Pro Pro Pro Asp Leu 



810 



CGG AGT CTG GGC TTC GGC CGC AGG GTG CGG CCT CCC CAG GAG CGG 855 
Arg Ser Leu Gly Phe Gly Arg Arg Val Arg Pro Pro Gin Glu Arg 

GCC CTG CTG GTG GTA TTC ACC AGA TCC CAG CGC AAG AAC CTG TTC 900 
Ala Leu Leu Val Val Phe Thr Arg Ser Gin Arg Lys Asn Leu Phe 

GCA GAG ATG- CGC GAG CAG CTG GGC TCG GCC GAG GCT GCG. GGC CCG £45 
Ala Glu Met Arg Glu Gin Leu Gly Ser Ala Glu Ala Ala Gly Pro 

GGC GCG GGC GCC GAG GGG TCG TGG CCG CCG CCG TCG GGC GCC CCG' 990 
Gly Ala Gly . Ala Glu Gly Ser, Trp Pro Pro. Pro Ser Gly Ala Pro 

GAT GCC AGG CCT TGG CTG CCC TCG CCC GGC CGC CGG CGG CGG CGC 1035 
Asp Ala 1 Arg Pro Trp Leu Pro Ser Pro Gly Arg Arg Arg Arg Arg 

ACG GCC TTC GCC AGT C(5£' "CaV GGC AAG CGG CAC GGC" AAG "AAG' TCC 1080 
■Thr Ala. Phe .Ala. Ser Arg .His, Gly Lys Arg His Gly Lys Lys Ser 

; AGG-CTA CGC TGC .AGC AAG AAG CCC CTG CAC GTG AAC TTC AAG GAG - 112S ~ 
Arg' Leu Arg Cys J Ser Lys Lys : Pro Leu His Val Asn Phe Lys Glu ' 

CTG GGC TGG GAC GAC TGG ~ATT ~ ATC G CGf CCC CTG GA3G TAC' GAG" GCC' ' " ' 1 1 7 0 : ' 
. Leu Qly-JTrp Asp Asp Trp lie lie Ala Pro Leu Glu Tyr Glu Ala 

TAT CAC TGC GAG GGT GTA TGC- GAC -TTC CCG CTG CGC TCG CAC CTG 1215 

Tyr His Cys Glu Gly* Val Cys Asp : pfie Pro Leu Arg Ser 1 His " Leu < < 

GAG CCC ACC AAC ; CAC GCC ATO ATC CAG ACG CTG ATG AAC TCG ATG 1260 
Glu Pro Thr Asn His. Ala. lie lie Gin Thr Leu Met Asm Ser Met 

GAC , CC T C , GC ? c: . TCC ACC CCG CCC AGC TGC TGC GTG CCC ACC AAA TTG 1305 

Asp Pro'Giy Ser Thr Pro Pro Ser Cys Cys Val Pro Thr Lys Leu 

ACT CCC ATC AGC ATT CTA TAC ATC GAC GCG GGC AAT AAT GTG GTC 1350 
Thr Pro lie Ser lie Leu Tyr lie Asp Ala Gly Asn Asn Val Val 

TAC AAG CAG TAC GAG GAC ATG GTG GTG GAG TCG TGC GGC TGC AGG 1395 
Tyr Lys :Gln- Tyr ! Glu-' Asp Met Val Val Glu Ser Cys Gly Cys" Arg 

TAG CGG TGC- CTT' -TCC "CGC CGC -CTT GGC. CCG" - . 1425 

In an analysis of public nucleic acid sequence databases, it was found, for example, that 
the AMF10 nucleic acid sequence has 95/98 bases (96%) identical to bos taurus cartilage- 
derived morphogenic protein 2 (GenBank Acc. No. BTU13661) (SEQ ID NO:84) shown in 
Table 1 OB. In all BLAST alignments herein, the "E-yalue" or "Expect" value is a numeric 
indication of the probability that the aligned sequences could have achieved their similarity to 
the BLAST query sequence by chance alone, within the database that was searched. 

Table 10B. BLASTN of AMF10 against CDMP 2 (SEQ ID NO:84) 

>BTU13661 U13661 Bos taurus cartilage-derived morpho gene tic protein 2 (CDMP-2) niRNA, 
complete cds. 1/1995, Length = 1308; Strand = Plus / Plus 

Score =. 170 bits (86) , Expect = 8e-41 

Identities =^95/98 (96%) 

Query: 3 gacttactccatcgctgagaagctgggcatcaatgccagctttttccagtcttccaagtc 62 

ii i in. hi in i ii .iiiiiiiiiiiiiiii ni i iniiiiiiiii iiimimiii 

Sbjct: 234 gacttactccatcgccgagaagctgggcatcaatgctagctttttccagtcttccaagtc 293 
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Query: 63 ggctaatacgatcaccagctttgtagacaggggactag 100 

lllllllllllllll lillllllllllMIIIIIIII 

Sbjct: 294 ggctaatacgatcactagctttgtagacaggggactag 331 

5 

Additional BLASTN information for related nucleic acid sequences is shown in Table 

IOC. 

Table IOC 

Score E 

10 Sequences producing significant alignments: (bits) Value 

BTU13661 U13661 Bos taurus cart ilage- derived morphogenetic . . . 170 8e-41 
AC058786 AC058786 Mus musculus clone RP23-117o7, complete ... 151 7e-35 
AF155125 AFi55125 Xenopus laevis growth and differentiation. 56 3e-06 

15 

A BLASTP search was performed against public protein databases. The result from 
this comparison are shown in Tables 10D. As shown in Table 10D, the AMF10 protein has 
354 of 435 amino acid residues (81%) identical to, and 372 of 435 residues (85 %) positive 
with, the 436 amino acid residue long bos taurus growth and differentiation factor 6 precursor. 
20 (Acc.No. P55106) (SEQIDNO:85). 

Table 10D. BLASTP of AMF10 against GDF 6 precursor (SEQ ID NO:85) 

>ptnr:SWISSPROT-ACC:P55106 GROWTH/DIFFERENTIATION FACTOR 6 PRECURSOR (GDF- 
6) (CARTILAGE -DERIVED MORPHOGENETIC PROTEIN 2) (CDMP-2) - Bos taurus (Bovine), 436 aa 
(fragment) . Length .= 436 
25 Score = 1795 (631.9 bits), Expect = 6.3e-185, P = 6.3e-185 . 

Identities = 354/435 (81%) , Positives = 372/435 (85%) 

Query: 33 SSAELGSTKGMRSRKEGKMQRAPRDSDAGREG QEPQPRPQDEPRA QQPRAQEPP 86 

+ IIIIM IIM + IMM MM++ II Mi 11.11 + 1*1+ III MM 

Sbjct: 2 ASAELGSAKGMRTRKEGRMPRAPRENATAREPLDRQEPPPRPQEEPQRRPPQQPEAREPP 61 

Query: 87 GRGPRWPHEYMLS I YRTYS IAEKLG INAS FFQSSKSANTITSFVDRGI*ODIiSHTPLRRQ 14 6 

MMI + MIMIMMMMMMIMMIMMMMMMMMMMIMMNN" 

Sbjct: 62 GRGPRLVPHEYMLSIYRTYSIAEKLGINASFFQSSKSANTITSFVDRGLDDLSHTPLRRQ 121 

Query: 147 KYLFDVSMLSDKEELVGAELRLFRQAPSAPWGPPAGPLHVQLFPCLSPLLLDARTLDPQG 2 06 

lllllll MIIIMMI++HIMM + I I I t.l I ++I | II 

Sbjct: 122 KYL FDVS TLS DKEELVGAD VRL FRQAPAALAP PAAAP LAALRLP - VAPAAGS AE P - GPAG 179 



35 



40 Query: 2 07 APPAGWEVFDVWQGLRHQPWKQLCLELRAAWG-SLDAGEAEARARGPQQPPPPDLRSLGF 2 65 

II II II II ll + l II III I III 1 1 II MM I I I III MINIM 1 1 llll I 

Sbjct: 130 APRPGWEVFDVWRGLRPQPWKQLCLELiRAAWGGEPGAAEDEARTPGPQQPPPPDl.RSLGF 23 9 
Query: 266 GRRVRPPQERALLVVFTRSQRKNLFAEMREQLGSA-EAAGPGAGAEGSWPPP S 317 

45 Mill 1 1 1 1 1 1 1 1 1 1 - 1 1 1 1 1 1 1 1 1 f 1 1 1 : 1 1 1 I Ml Mill Ml I 

Sbjct: 240 GRRVRTPQERALLWFSRSQRKTLFAEMREQLGSATEWGPGGGAEGSGPPPPPPPPPPS 299 
Query: 318 GAPDARPWLPSPGRRRRRTAFASRHGKRHGKKSRLRCSKKPLHVNFKELGWDDWIIAPLE 377 

, A llll I 1 1 II 1 1 1 1 1 1 1 1 II II 1 1 1 II M 1 1 1 1 1 II I II II II 1 1 1 II I M M M I 

3U Sbjct: 300 GTPDAGLWSPSPGRRRR-TAFASRHGKRHGKKSRLRCSKKPIiHVNFPCELGWDDWIIAPLE 358 
Query: 378 YEAYHCEGVCDFPIJISHLEPTNHAIIQTLMNSMDPGSTPPSCCVPTKLTPISILYIDAGN 437 

II; II MMI IN M I 111 1 1 MM Ml I MMMIMr I II I I Hi IIIIIMMII i 

Sbjct: 359 YEAYHCEGVCDFPIJ?SHIjEPTNHAIIQTIjMNSMDPGSTPPSCCVPTKLT?ISILYIDAGN 418 



55 



Query: 43 8 NWYKQYEDMWESCGCR 4 55 

MM +II + IIIIMIII 

Sbjct: 419 NWYNEYEEMWESCGCR 43 6 
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Expression information for AMFX RNA was derived using tissue sources including, 
but not limited to, proprietary database sources, public EST sources, literature sources, and/or 
RACE sources, as described in the Examples. AMF10 is expressed in at least, e.g., astrocytoma 
and glioma derived tissue. The nucleic acids and proteins of AMF10 are-useful in potential 
therapeutic applications implicated in various AMF-related pathologies and/or disorders. For . 
example, a cDNA encoding the growth/differentiation factor-6-like protein may be useful in 
gene therapy, and the growth/differentiation factor-6-like protein may be useful when 
administered to a subject in need thereof. The novel. nucleic acid encoding AMF10 protein, or 
fragments thereof, may further: be useful in diagnostic apphcati<ms^ wheisin thecpresence or 
amount of the nucleic acid or the protein are to be assessed; These materials arc: further useful 
in the generation of antibodies^thatbhid inunm^ of the 

invention for use in therapeutiaor diagnostic methodsL h .- : — 

The AMFX nucleic acids and proteins are useful-ia potential diagnostic and therapeutic 
applications implicated inr various diseases and disorders described below and/or, other 
pathologies. For example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from cancer and other ceU pmliferative disorders. By way of 
nonlimiting example, the compositions of the:present invention will have efficacy for treatment 
of patients suffering from cancer and other cell proliferative disorders; Additional AMF- 
related diseases and disorders are mentioned throughoutthe Specification. : 

Further, the protein similarity information, expression pattern, and map. location for 
AMF10 suggests that AMF10 may have important structural and/or physiological functions 
characteristic of the AMF family. Therefore* the nucleic acids and proteins of the invention are 
useful in potential diagnostic and therapeutic applications and as a research tool. These include 
serving as a specific or selective nucleic acid or protein diagnostic and/or prognostic marker, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed, as well as 
potential therapeutic applications such as the following: (i) a protein therapeutic, (ii) a small 
molecule drug target, (iii) an antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), (iv) a nucleic acid useful in gene therapy (gene delivery/gene ablation), and (v) a 
composition promoting tissue regeneration in vitro and in vivo (vi) biological defense weapon. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the novel AMF 10 substances for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-AMFX Antibodies" section 
below. In various embodiments, contemplatedAMFlO epitopes are hydrophilic regions of the 
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AMF10 polypeptide as predicted by software programs well known in the art that generate 
hydrophobicity or hydrophilicity plots. 

AMFX Nucleic Acids and Polypeptides 

Novel AMFX nucleic acid and polypeptide sequences disclosed in the invention 
5 include those summarized in Table 11. 

Table 11. AMFX Sequences and Corresponding SEQ ID Numbers 



AMFX 
No. 


Internal Identification 


SEQ ID NO 
(nucleic acid) 


SEQ ID NO 
(polypeptide) 


Homology 


1 


14209510 


1 


2 


Fibrillin 2 precursor 


2 


20421338 


3 


4 


Nephrin 


3 


27251385 


5 


6 


Fibrillin 2 precursor 


4 


27486474 


7 


8 


Plasminogen 


5 


29691387 


9 


10 


Organic Anion Transporter 


6 


12996895_1 


11 - 


12 


MEGF6 


7 


38905521 


13 


14 


IL-11 


8 


AC11036_A 


15 


16 


Pleiotrophin 


9 


AL307658 


17 


18 


GPCR13 


10 


GMG55707 EXT.0.1 
dal 


19 


20 


GDF6 



One aspect of the invention pertains to isolated nucleic acid molecules that encode 
AMFX polypeptides or biologically-active portions thereof. Also included in the invention are 

10 nucleic acid fragments sufficient for use as hybridization probes to identify AMFX-encbding 
nucleic acids (e.g. , AMFX mKNAs) and fragments for use as PCR primers for the 
amplification and/or mutation of AMFX nucleic acid molecules. As used herein, the term 
"nucleic acid molecule" is intended to include DNA molecules (e.g., cDNA or genomic DNA), 
RNA molecules (e.g., rriRNA), analogs of the DNA or RNA generated using nucleotide 

1 5 analogs, and derivatives, fragments and homologs thereof. The nucleic acid molecule may be 
single-stranded or double-stranded, but preferably is comprised double-stranded DNA. 

An AMFX nucleic acid can encode a mature AMFX polypeptide. As used herein, a 
"mature" form of a polypeptide or protein disclosed in the present invention is the product of a 
naturally occurring polypeptide or precursor form or proprotein. The naturally occurring 

20 polypeptide, precursor or proprotein includes, by way of nonlimiting example, the full length 
gene product, encoded by the corresponding gene. Alternatively, it may be defined as the 
polypeptide, precursor or proprotein encoded by an ORF described herein. The product 
"mature'* form arises, again by way of nonlimiting example, as a result of one or more 
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naturally occurring processing steps as they may take place within the cell, or host cell, in 
which the gene product arises. Examples of such processing steps leading to a "mature" form 
of a polypeptide or protein include the cleavage of the N-terminal methionine residue encoded 
by the initiation codon of an ORF, or the proteolytic cleavage of a signal peptide or leader 
sequence. Thus a mature form arising from a preciirsor polypeptide or protein that has residues 
1 to N, where-residue 1 is the N-terminal methionine, would have residues 2 through N 
remaining after removal of the N-terminal methionine/ Alternatively, a mature form arising 
from a precursor polypeptide or protein having residues 1 to N; in which an N-terminal signal 
sequence from residue t to residue M is cleaved, would have the residues from residue M+l to 
residue N remaining. Further as used herein, a "mature" fonn of a polypeptide or protein may 
arise from a step ofpost-trianslational modification other than a proteolytic cleavage event. 
Such additional processes include, by:w^ - 

myristoylkion or pho'sphoiy^ poT^feptide df protein may result from 

the operation^ofKraly 

The temf "probes", as utilized herein, refers to nucleic acid sequences of variable 
length, preferably between at least about 10 nucleotides (nt), 100 ht, or as many as 
approximately; e.g. , 6,000 nt, depending upoii the specific use. Probes are used in the 
detection of identical; similar, or complementary nueleie acid sequences. Longer length probes 
are generally obtained from a natural or recombinant source, are highly specific, and much 
slower to hybridize thm stibrt^^ Probes may be single- or double- 

stranded and designed to have specificity in PCR; membrane-based hybridization technologies, 
or ELISA-lifce technologies. 1 : - - 

The term "isolated^ nucleic acid molecule, as utilized herein, is one which is separated 
fromother nucleic acid molecules Which are present in the natural source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of sequences which naturally flank the nucleic 
acid (z.e. , sequences located at the 5 - and 3Mermirii of the nucleic acid) in the genomic DNA 
of the organism from which the nucleic acid is derived. For example, in various embodiments, 
the isolated AMFX nucleic acid molecules can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 
kbi 0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in 
genomic DNA of the "cell/tissue from which the nucleic acid is derived {e.g., brain, heart, liver, 
spleen, etc.); Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material or culture medium when produced by recombinant 
techniques, or of chemical precursors or other chemicals when chemically synthesized. : 
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A nucleic acid molecule of the invention, e.g., a nucleic acid molecule having the 
nucleotide sequence of SEQ IDNOS:l,3 ? 5, 7, 9, 11, 13, 15, 17 and 19, or a complement^ 
this aforementioned nucleotide sequence, can be isolated using standard molecular biology 
techniques and the sequence information provided herein. Using all or a portion of the nucleic 
5 acid sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17 and 19 as a hybridization probe, 
AMFX molecules can be isolated using standard hybridization and cloning techniques (e.g., as 
described in Sambrook, et aL, (eds.), Molecular Cloning: A Laboratory Manual 2 nd Ed., 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989; and Ausubel, et al 7 
(eds.), Current Protocols in Molecular BioloqV, John Wiley & Sons, New York, NY, 
10 1993.) 

A nucleic acid of the invention can be amplified using cDNA, mRNA or alternatively, 
genomic DNA, as a template and appropriate oligonucleotide primers according to standard 
PCR amplification techniques. The nucleic acid so amplified can be cloned into an appropriate 
vector and characterized by DNA sequence analysis. Furthermore, oligonucleotides 

1 5 corresponding to AMFX nucleotide sequences can be prepared by standard synthetic 
techniques, e.g., using an automated DNA synthesizer. 

As used herein, the term "oligonucleotide" refers to a series of linked nucleotide 
residues, which oligonucleotide has a sufficient number of nucleotide bases to be used in a 
PCR reaction. A short oligonucleotide sequence may be based on, or designed from, a 

20 genomic or cDNA sequence and is used to amplify, confirm, or reveal the presence of an 
identical, similar or complementary DNA or RNA in a particular cell or tissue. 
Oligonucleotides comprise portions of a nucleic acid sequence having about 10 nt, 50 nt, or 
1 00 nt in length, preferably about 1 5 nt to 30 nt in length. In one embodiment of the invention, 
an oligonucleotide comprising a nucleic acid molecule less than 100 nt in length would further 

25 comprise at least 6 contiguous nucleotides of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17 and 19, 
or a complement thereof. Oligonucleotides may be chemically synthesized and may also be 
used as probes. 

In another embodiment, an isolated nucleic acid molecule of the invention comprises a 
nucleic acid molecule that is a complement of the nucleotide sequence shown in SEQ ID 
30 NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17 and 19, or a portion of this nucleotide sequence (e.g., a 

fragment that can be used as a probe or primer or a fragment encoding a biologically-active 
portion of an AMFX polypeptide). A nucleic acid molecule that is complementary to the 
nucleotide sequence shown in SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17and 19, is one that is 
sufficiently complementary to the nucleotide sequence shown in SEQ ID NOS. l, 3, 5, 7, 9, 1 1, 
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13, 15, 17 and 19, that it can hydrogen bond with little or no mismatches to the nucleotide 
sequence shown in SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17 and 19, thereby forming a stable 
duplex. 

As used herein, the. term "complementary" refers to Watson-Crick or Hoogsteen base 
5 pairing between nucleotides units of a nucleic acid molecule, and the term "binding" means the 
physical or chemical interaction between two polypeptides or compounds or associated 
polypeptides or compounds or combinations thereof. Binding includes ionic, non-ionic, van 
der Waals, hydrophobic interactions^ and the like, ^^bysical. interaction can.be either direct or 
indirect Indirect interactions may be through or due ;to the; effects of another polypeptide or 

10 compound; Direct binding refers to interactions that do not take pjace through^ or due to, the 
effect of another polypeptide or compound* but instead are without othe^ substantial chemical 
intermediates... , yi: 7 A ric.r..:vr.} n;^:^:^^ - :: . ;nc ^ ;v« - 

; : Fi^merit^pravided herein tare defined as sequebcss of atte^t-6(cpntiguoiis) nucleic 
acids or at least 4 (contiguous) iamiha acids* a length sufficient tQ:alld\y for specific 

15 hybridization in the case of nucleic, acids or for specific recognition of an epitope in the case of 
amino acids, respectively, and are at mostsome portion less tiia&afulL length sequence. 
Fragments may be derived from anyl contiguous portion of a nucleic acid or amino acid 
sequenceof choice. . Derivatives are: nucleic acidsequences or-amino:a^id sequences fomied 
from the native compounds either directly orby modification or partial substitution. Analogs 

20 are nucleic acid sequences or amino acid sequences -that hatver a structure similar to r but not 
identical to, the native compound but differs from it in respect to certaia components or side 
chains. Analogs may be synthetic or from a different evolutionary origin andmay have a : 
similar or opposite metabolic activity compared to wild typei JHLomologs are- nucleic acid 
sequences or amino acid sequences of a particular gene that are derived from different species. 

25 Derivatives and analogs may be full length or other than full length, if the derivative or 

analog contains a modified nucleic acid or amino acid, as described below. Derivatives or 
analogs of the nucleic acids or proteins of the invention include, but are not limited to, 
molecules comprising regions that are substantially homologous to the nucleic acids or proteins 
of the invention, in various embodiments, by at least about: 70%, 80%, or. 95% identity (with a 

30 preferred identity of 80-95%) over a nucleic acid or amino acid sequence of identical size or 
when compared to an aligned sequence in which the alignment is done by a computer 
homology program known in the art, or whose encoding nucleic acid is capable of hybridizing 
to the complement of a sequence encoding the aforementioned proteins under stringent, 
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moderately stringent, or low stringent conditions. See e.g. Ausubel, et ah, Current 
Protocols in Molecular Biology, John Wiley & Sons, New York, NY, 1 993, and below. 

A "homologous nucleic acid sequence" or "homologous amino acid sequence," or 
variations thereof, refer to sequences characterized by a homology at the nucleotide level or 
5 amino acid level as discussed above. Homologous nucleotide sequences encode those 

sequences coding for isoforms of AMFX polypeptides. Isoforms can be expressed in different 
tissues of the same organism as a result of, for example, alternative splicing of RNA. 
Alternatively, isoforms can be encoded by different genes. In the invention, homologous 
nucleotide sequences include nucleotide sequences encoding for an AMFX polypeptide of 

10 species other than humans, including, but not limited to: vertebrates, and thus can include, e.g., 
frog, mouse, rat, rabbit, dog, cat cow, horse, and other organisms. Homologous nucleotide 
sequences also include, but are not limited to, naturally occurring allelic variations and 
mutations of the nucleotide sequences set forth herein. A homologous nucleotide sequence 
does not, however, include the exact nucleotide sequence encoding human AMFX protein. 

1 5 Homologous nucleic acid sequences include those nucleic acid sequences that encode 

conservative amino acid substitutions (see below) in SEQ ID NOS :2, 4, 6, 8, 1 0, 1 2, 14,-16, 18 
and 20, as well as a polypeptide possessing AMFX biological activity. Various biological 
activities of the AMFX proteins are described below. ^ 

An AMFX polypeptide is encoded by the open reading frame ("ORF") of an AMFX 

20 nucleic acid. An ORF corresponds to a nucleotide sequence that could potentially be translated 
into a polypeptide. A stretch of nucleic acids comprising an ORF is uninterrupted by a stop 
codon. An ORF that represents the coding sequence for a full protein begins with an ATG 
"start" codon and terminates with one of the three "stop" codons, namely, TAA, TAG, or TGA. 
For the purposes of this invention, an ORF may be any part of a coding sequence, with or 

25 without a start codon, a stop codon, or both. For an ORF to be considered as a good candidate 
for coding for a bona fide cellular protein, a minimum size requirement is often set, e.g., a 
stretch of DNA that would encode a protein of 50 amino acids or more. 

The nucleotide sequences determined from the cloning of the human AMFX genes 
allows for the generation of probes and primers designed for use in identifying and/or cloning 

30 AMFX homologues in other cell types, e.g. from other tissues, as well as AMFX hpmologues 
from other vertebrates. The probe/primer typically comprises substantially purified 
oligonucleotide. The oligonucleotide typically comprises a region of nucleotide sequence that 
hybridizes under stringent conditions to at least about 12, 25, 50, 100, 150, 200, 250, 300, 350 
or 400 consecutive sense strand nucleotide sequence of SEQ IDNOS:l,3, 5, 7, 9, 11, 13, 15, 
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17 and 19; or an anti-sense strand nucleotide sequence of SEQ IDNOS:l, 3, 5, 7, 9, 11, 13, 15, 
17 and 19; or of a naturally occurring mutant of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17 and 
19. 

Probes based on the human AMFX nucleotide sequences can be used to detect 
transcripts or genomic sequences encoding the same or homologous proteins. In various 
embodiments, the probe further comprises a label group attached thereto, e.g. the label group 
can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such 
probes can be used as a part of a diagnostic test kit for identifying cells or tissues which mis- 
express an AMFX protein, such as by measuring a level of an AMFX-encoding nucleic acid in 
a sample of cells from a subject e.g., detecting AMFX mRNA levels or determining whether a 
genomic AMFX gene has been mutated or deleted. 

"A polypeptide having a biologically-active portion of an AMFX polypeptide" refers to 
polypeptides exhibiting activity similar, but not necessarily identical to, an activity of a 
polypeptide of the invention, including mature forms, as measured in a particular biological 
assay, with or without dose dependency. A nucleic acid fragment encoding a "biologically- 
active portion of AMFX" can be prepared by isolating a portion of SEQ ID NOS: 1 , 3, 5, 7, 9, 
11, 13, 15, 17 and 19, that encodes a polypeptide having an AMFX biological activity (the 
biological activities of the AMFX proteins are described below), expressing the encoded 
portion of AMFX protein (e.g., by recombinant expression in vitro) and assessing the activity 
of the encoded portion of AMFX. 

AMFX Nucleic Acid and Polypeptide Variants 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequences shown in SEQ ID NOS: 1,3, 5, 7, 9, 1 1, 13, 15, 17 and 19, due to 
degeneracy of the genetic code and thus encode the same AMFX proteins as that encoded by 
the nucleotide sequences shown in SEQ ID NO NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17 and 19. In 
another embodiment, an isolated nucleic acid molecule of the invention has a nucleotide 
sequence encoding a protein having an amino acid sequence shown in SEQ ID NOS:2, 4, 6, 8, 
10, 12, 14, 16, 18 and 20. 

In addition to the human AMFX nucleotide sequences shown in SEQ ID NOS:l, 3, 5, 
7, 9, 1 1, 13, 15, 17 and 19, it will be appreciated by those skilled in the art that DNA sequence 
polymorphisms that lead to changes in the amino acid sequences of the AMFX polypeptides 
may exist within a population (e.g., the human population). Such genetic polymorphism in the 
AMFX genes may exist among individuals within a population due to natural allelic variation. 
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As used herein, the terms "gene" and "recombinant gene" refer to nucleic acid molecules 
comprising an open reading frame (ORF) encoding an AMFX protein, preferably a vertebrate 
AMFX protein. Such natural allelic variations can typically result in 1-5% variance in the 
nucleotide sequence of the AMFX genes. Any and all such nucleotide variations and resulting 
5 amino acid polymorphisms in the AMFX polypeptides, which are the result of natural allelic 
variation and that do not alter the functional activity of the AMFX polypeptides, are intended 
to be within the scope of the invention. 

Moreover, nucleic acid molecules encoding AMFX proteins from other species, and 
thus that have a nucleotide sequence that differs from the human sequence of SEQ ID NOS: 1 , 

10 3, 5, 7, 9, 11, 13, 15, 17 and 19, are intended to be within the scope ofthe invention. Nucleic 
acid molecules corresponding to natural allelic variants and homologues of the AMFX cDNAs 
of the invention can be isolated based on their homology to the human AMFX nucleic acids 
disclosed herein using the human cDNAs, or a portion thereof, as a hybridization probe, 
according to standard hybridization techniques under stringent hybridization conditions: 

15 Accordingly, in another embodiment, an isolated nucleic acid molecule ofthe invention 

is at least 6 nucleotides in length and hybridizes under stringent conditions to the nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID NOSrl, 3, 5, 7, 9, 1 1, 13, 15, 17 and 
19. In another embodiment, the nucleic acid is at least 10, 25, 50, 100, 250, 500, 750, 1000, 
1500, or 2000 or more nucleotides in length. In yet another embodiment, an isolated nucleic 

20 acid molecule of the invention hybridizes to the coding region. As used herein, the term 

"hybridizes under stringent conditions" is intended to describe conditions for hybridization and 
washing under which nucleotide sequences at least 60% homologous to each other typically 
remain hybridized to each other. 

Homologs (i.e., nucleic acids encoding AMFX proteins derived from species other than 

25 human) or other related sequences (e.g., paralogs) can be obtained by low, moderate or high 

stringency hybridization with all or a portion of the particular human sequence as a probe using 
methods well known in the art for nucleic acid hybridization and cloning. 

As used herein, the phrase "stringent hybridization conditions" refers to conditions 
under which a probe, primer or oligonucleotide will hybridize to its target sequence, but to no 

30 other sequences. Stringent conditions are sequence-dependent and will be different in different 
circumstances. Longer sequences hybridize specifically at higher temperatures than shorter 
sequences. Generally, stringent conditions are selected to be about 5°C lower than the thermal 
melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the 
temperature (under defined ionic strength, pH and nucleic acid concentration) at which 50% of 
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the probes complementary to the target sequence hybridize to the target sequence at 
equilibrium. Since the target sequences are generally present at excess, at Tm, 50% of the 
probes are occupied at equilibrium. Typically, stringent conditions will be those in which the 
salt concentration is less than about 1 .0 M sodium ion^ typically about 0.01 to 1 .0 M sodium 
5 ion (or other salts) at 

pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes, primers or 
oligonucleotides {e.g., 10 nt to 50 nt) and at least about 60°C for longer probes, primers and 
oligonucleotides. Stringent conditions may also be achieved with the addition of destabilizing 
agents, such as forinamide: r : 
10- „ . Stringent conditions are known to those s^ 

- e l Q h (?^;)^V!^ OT Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3.1-6.3.6. Preferably, the conditions are such that sequences, at least about 65%,,70%, 75%, 
$5% 90%, 95%, 98% y or 99% homologousto each otter t^ically .rei^rthybri 
other. A non-limitirig example of stringent hybridization conditions are hybridiiatiptfm a high 
^ 15 salt buffer comprising* 6XSS(% 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, b.02% 

Ficoll, 0.02% BSA, and 500 mg/ml denatured salmon sperm DNA at 65°C, followed by one or 
^ more washes in 0.2X SSC, 0.01% BSA at 50°C. An isolatedJiueleie acid molecule of the 

^ - invention that hybridizes under stririgeht conditions to the sequences of SEQ ID NOS : 1 , 3, 5, 

'Z^z 7^ 9, 11,13, ISy l/Z and 19, corresponds ta^a nalxiraUy^ccurTing nucleic acid molecule. As 

v ^ 20 used herein, a "naftu^yHSccumng" nucleic acid molecule refers to an RNA or DNA molecule 
% . having a nucleotide sequence that occurs in nature (e.g , encodes a natural protein). 
: S1 In * second ©mbodknent^ : a nucleic acid sequence that is hybridizable to the nucleic acid 

q- molecule comprising the nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17 and 
-S: . 1 9i or fiagments, analogs or derivatives thereof, under conditions of moderate stringency is 

25 provided. A non-limiting example of moderate stringency hybridization conditions are 
^ hybridization in 6X SSC, 5X Denhardt's solution, 0.5% SDS and 100 mg/ml denatured salmon 

sperm DNA at 55°C, followed by one or more washes in IX SSC, 0. 1 % SDS at 37°C. Other 
conditions of moderate stringency that may be used are well-known within the art. See, e.g., 
1 Ausubel, et al (eds.), 1 993, Current Protocols in Molecular Biology, John Wiley & 
30 Sons, NY, and Kriegler, 1 990; Gene Transfer and Expression, A Laboratory Manual, 

J _ Stockton Press, NY. 

In a third embodiment, a nucleic acid that is hybridizable to the nucleic acid molecule 
comprising the nucleotide sequences of SEQ ID NOS:l, 3, 5, 7, 9, 1 1,13, 15, 17 and 19, or 
fragments, analogs or derivatives thereof, under conditions of low stringency, is provided. A 
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non-limiting example of low stringency hybridization conditions are hybridization in 35% 
formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 
0.2% BSA, 100 mg/ml denatured salmon sperm DNA, 10% (wt/vol) dextran sulfate at 40°C; 
followed by one or more washes in 2X SSC, 25 mM Tris-HCl (pH 7.4), 5 mM EDTA, and 
5 0.1% SDS at 50°C. Other conditions of low stringency that may be used are well known in the 
art (e.g., as employed for cross-species hybridizations). See, e.g., Ausubel, et al. (eds.), 1993, 
Current Protocols in Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1 990, 
Gene Transfer and Expression, A Laboratory Manual, Stockton Press, NY; Shilo and 
Weinberg, 1981 . Proc Natl Acad Set USA 78: 6789-6792. 

10 Conservative Mutations 

In addition to naturally-occurring allelic variants of AMFX sequences that may exist in 
the population, the skilled artisan will further appreciate that changes can be introduced by 
mutation into the nucleotide sequences of SEQ ID NO NOS:l, 3, 5, 7, 9, 11,13,15, 17 and 19, 
thereby leading to changes in the amino acid sequences of the encoded AMFX proteins; 

1 5 without altering the functional ability of said AMFX proteins. For example, nucleotide 

substitutions leading to amino acid substitutions at "non-essential" amino acid residues can be 
made in the sequence of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20. A "non-essential' 1 
amino acid residue is a residue that can be altered from the wild-type sequences of the AMFX 
proteins without altering their biological activity, whereas an "essential" amino acid residue is 

20 required for such biological activity. For example, amino acid residues that are conserved 
among the AMFX proteins of the invention are predicted to be particularly non-amenable to 
alteration. Amino acids for which conservative substitutions can be made are well-known 
within the art. 

Another aspect of the invention pertains to nucleic acid molecules encoding AMFX 
25 proteins that contain changes in amino acid residues that are not essential for activity. Such 
AMFX proteins differ in amino acid sequence from SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 
and 20, yet retain biological activity. In one embodiment, the isolated nucleic acid molecule 
comprises a nucleotide sequence encoding a protein, wherein the protein comprises an amino 
acid sequence at least about 45% homologous to the amino acid sequences of SEQ ID NOS:2, 
30 4, 6, 8, 10, 12, 14, 16, 18 and 20. Preferably, the protein encoded by the nucleic acid molecule 
is at least about 60% homologous to SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20; more 
preferably at least about 70% homologous to SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20; 
still more preferably at least about 80% homologous to SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 
1 8 and 20; even more preferably at least about 90% homologous to SEQ ID NOS:2, 4, 6, 8,10, 
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12, 14, 16, 18 and 20; and most preferably at least about 95% homologous to SEQ ID NOS.2, 
4,6,8, 10, 12, 14, 16, 18 and 20. 

An isolated nucleic acid molecule encoding an AMFX protein homologous to the 
protein of SEQ ID NOS:2, 4, ,6, 8, 10, 12, 14, 16, 18 and 20, can be created by introducing one 
, or more nucleotide substitutions, additions or deletions into the nucleotide sequence of SEQ ID 
NOS:l, 3, 5, 7, 9, 1 1,. : 13, 15,17 and 19, such that one or more amino acid substitutions, 
additions or deletipns are introduced into the encoded protein. 

Mutations ^can J>e introduced tato SEQ ro.NOS:2, ; 4, 6, 8,, 10, 12, i 4, 16, 18 and 20, by 
standard techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. 
Preferably, conservative amino acid substitutions are made at one or more predicted, 
non-essential amino acid residues. A "conservative amino acid substitution" is one in which 
the amino acid residue is replaced with an amino acid residue having a similar side chain. 
Families of anvino acid residues haying similar side chains have been defined within the art. 
These families include amino acids with basic side chains,(e.#., lysine, arginine, histidine), 
acidic side chains (e.g, aspartic acid, glutamic acid), uncharged polar side chains fee., glycine 
asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, 
valine, leucine, isoleucine, proline, ^ methionine, tryptophan), beta-branched side 

chains (e.g. , threonine, valine, isoleucine) and aromatic side chains JLf^g.^ tyrosine, 
phenylalanine, tryptophan, histidine). Thus, a p^icted ro^ amino acid residue in the 

AMFX protein is replaced with another amino acid residue from the same side chain family. 
Alternatively, in another embodiment, mutations can be introduced randomly along all or part 
of an AMFX coding sequence, such as by saturation mutagenesis, and the resultant mutants 
can be screened for AMFX biological activity to identify mutants that retain activity. 
Following mutagenesis of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20, the encoded 
protein can be expressed by. any recombinant technology known in the art and the activity of 
the protein can be determined. 

The relatedness of amino acid families may also be determined based on side chain 
interactions. Substituted amino acids may be fully conserved "strong" residues or fully 
conserved "weak" residues. The "strong" group of conserved amino acid residues may be any 
one of the following groups: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, FYW, 
wherein the single letter amino acid codes are grouped by those amino acids that may be 
substituted for each other. Likewise, the "weak" group of conserved residues may be any one 
of the following: CSA, ATV, SAG, SINK, STPA, SGND, SNDEQK, NDEQHK, NEQHRK, 
VLIM, HFY, wherein the letters within each group represent the single letter amino acid code. 
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In one embodiment, a mutant AMFX protein can be assayed for (/) the ability to form 
protein:protein interactions with other AMFX proteins, other cell-surface proteins, or 
biologically-active portions thereof, (//) complex formation between a mutant AMFX protein 
and an AMFX ligand; or (i if) the ability of a mutant AMFX protein to bind to an intracellular 
5 target protein or biologically-active portion thereof; (e.g. avidin proteins). 

In yet another embodiment, a mutant AMFX protein can be assayed for the ability to 
regulate a specific biological function (e.g., regulation of insulin release). 

Antisense Nucleic Acids 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules 

1 0 that are hybridizable to or complementary to the nucleic acid molecule comprising the 

nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17 and 19, or fragments, analogs 
or derivatives thereof. An "antisense" nucleic acid comprises a nucleotide sequence that is 
complementary to a "sense" nucleic acid encoding a protein (e.g., complementary to the coding 
strand of a double-stranded cDNA molecule or complementary to an mRNA sequence). In 

15 specific aspects, antisense nucleic acid molecules are provided that comprise a sequence 4 
complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire AMFX 
coding strand, or to only a portion thereof. Nucleic acid molecules encoding fragments, A 
homologs, derivatives and analogs of an AMFX protein of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 
1 6, 1 8 and 20; or antisense nucleic acids complementary to an AMFX nucleic acid sequence of 

20 SEQ ID NOS: 1 , 3, 5, 7, 9, 1 1 , 1 3, 1 5, 1 7 and 1 9, are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding an AMFX protein. The term 
"coding region" refers to the region of the nucleotide sequence comprising codons which are 
translated into amino acid residues. In another embodiment, the antisense nucleic acid 

25 molecule is antisense to a "noncoding region" of the coding strand of a nucleotide sequence 

encoding the AMFX protein. The term "noncoding region" refers to 5 f and 3 ' sequences which 
flank the coding region that are not translated into amino acids (i.e., also referred to as 5' and 3' 
untranslated regions). 

Given the coding strand sequences encoding the AMFX protein disclosed herein, 

30 antisense nucleic acids of the invention can be designed according to the rules of Watson and 
Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be complementary 
to the entire coding region of AMFX mRNA, but more preferably is an oligonucleotide that is 
antisense to only a portion of the coding or noncoding region of AMFX mRNA. For example, 
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the antisense oligonucleotide can be complementary to the region surrounding the translation 
start site of AMFX mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 
20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention 
can be constructed using chemical synthesis or enzymatic ligation reactions using procedures 
known in the art For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) 
can be chemically synthesized using haiui^y^ccurring nucleotides or variously modified 
nucleotides designed tb increase the biological stability of the molecules or to increase the 
physical stability of the duplex formed between the antisense and sense nucleic acids (e.g., 
phosphorothioate derivatives and acridirie substituted nucleotides dan be used)'. 

Examplfesfof modified nucleotides that can be used to generate the antisense nucleic 
acid incliide: ;r 5-fl^rduracfl; 5-6romolirac^ 
xanthine, 4^acetyl^6sine^ 

2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 
inosine, N6-isoperitenyfa^ 

2-ine^Iaderi^ 5-methylcytosine, N6-adenine, 

7-methy lguanine, 5-meffiylatoinomethy luracil, 5-methoxyaminomethy 1-2-thiouracil, 
beta-D-maimb^lqueosine^ 5-mefooxyiiraciC : 

2-methylthid-N6-isop^n^ (v), wybutbxosine" pseudouracll, 

queosine, 2-thiocytosine, 5-in^thyr-2^thidiiracil, 2-thiouracii; ^thioiiracil, 5-methyluracil, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 
3<3-aminoO-N-^&fo^^ Mernatively, the 

antisense nucleicacid can tte produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (z.e., RNA transcribed from the 
inserted nucleic aicid Will be of an antisense orientation to a target nucleic acid of interest, 
described further in the following subsection). 

The antisense nucleic acid molecules of thelnventioh are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DN A encoding ah AMFX prbtehi to thereby inhibit expression of the protein (e.g., by 
inhibiting transcription ^d/or translation). The hybridization can be by conventional 
nucleotide complementarity to form a stable duplex, or, for example, in the case of an antisense 
nucleic acid molecule that binds to DNA duplexes; through specific interactions in the major 
groove of the double helix. An example of a route of administration of antisense nucleic acid 
molecules of the invention includes direct injection at a tissue site. Alternatively, antisense 
nucleic acid molecules can be modified to target selected cells and then administered 
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systemically. For example, for systemic administration, antisense molecules can be modified 
such that they specifically bind to receptors or antigens expressed on a selected cell surface 
0 by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell 
surface receptors or antigens). The antisense nucleic acid molecules can also be delivered to 
5 cells using the vectors described herein. To achieve sufficient nucleic acid molecules, vector 
constructs in which the antisense nucleic acid molecule is placed under the control of a strong 
pol II or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
ot-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific 
10 double-stranded hybrids with complementary RNA in which, contrary to the usual p-units, the 
strands run parallel to each other. See, e.g., Gaultier, et aL 9 1987. Nucl. Acids Res. 1 5: 
6625-6641 . The antisense nucleic acid molecule can also comprise a 

2'-o-methylribonucleotide (see, e.g., Inoue, et al 1987. Nucl Acids Res. 15: 613 1-6148) or a 
chimeric RNA-DNA analogue (see, e.g., Inoue, et al, 1987. FEBS Lett. 215: 327-330. r 

15 Ribozymes and PNA Moieties 

Nucleic acid modifications include, by way of non-limiting example, modified bases, 
and nucleic acids whose sugar phosphate backbones are modified or derivatized. These 
modifications are carried out at least in part to enhance the chemical stability of the modified 
nucleic acid, such that they may be used, for example, as antisense binding nucleic acids in 

20 therapeutic applications in a subject. 

In one embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of cleaving 
a single-stranded nucleic acid, such as an mRNA, to which they have a complementary region. 
Thus, ribozymes {e.g., hammerhead ribozymes as described in Haselhoff and Gerlach 1988. 

25 Nature 334: 585-591) can be used to catalytically cleave AMFX mRNA transcripts to thereby 
inhibit translation of AMFX mRNA. A ribozyme having specificity for an AMFX-encoding 
nucleic acid can be designed based upon the nucleotide sequence of an AMFX cDNA 
disclosedherein(/.e.,SEQIDNOS:l,3,5, 7,9, 11, 13, 15, 17 and 19). Forexample,a 
derivative of a Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide 

30 sequence of the active site is complementary to the nucleotide sequence to be cleaved in an 
AMFX-encoding mRNA. See, e.g., U.S. Patent 4,987,071 to Cech, et al. and U.S. Patent 
5,11 6,742 to Cech, et al. AMFX mRNA can also be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel et al., (1993) 
Science 261:1411-1418. 
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Alternatively, AMFX gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the AMFX nucleic acid (e.g., the AMFX 
promoter and/or enhancers) to form triple helical structures that prevent transcription of the 
AMFX gene in target cells. See, e.g., Helene, 1991. Anticancer DrugDes. 6: 569-84; Helene, 
5 et all 992. Ann. MY. Acad. Set 660: 27-36; Maher, 1 992. Bioassays 14: 807-1 5. 

In various embodiments, the AMFX nucleic acids can be modified at the base moiety, 
sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or solubility 
of the molecule. For example, the deoxyribose phos^te backbone of the nucleic acids can be 
modified to generate peptide nucleic acids. See, e.g:; Hyrup, et aL, 1996. BioorgMed Chem 4: 
1 0 5-23. As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics 
DNA miniics) in which the deoxyribose phosph^e backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained, Theneutrai 
backbone of PN As has been shown to allow for specific hybridization to E>NA and SNA under 
conditions of low ionic strength. The synthesis of PNA oligomers cin be performed using 
1 5 standard solid phase peptide synthesis protocols as described in Hyrup, et aL, 1996 supra; 
Perry-O'Keefe, etaL\ 1996. Proc. Natl Acad Set US 

PNAs of AMFX can be used in therapeutic and diagnostic application For example, 
PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene 
expression by, e.g. , inducing transcription or translation arrest or inhibiting replication. PNAs 
20 of AMFX can also be used, for example, in the analyas of single base pair mutations in a gene 
(e.g. , PNA directed PCR clamping; as artificial restriction enzymes when used in combination 
with other engines, e.g., Si nucleases (see, Hyrup, et al^ 1996^wpr<jr); or as probes or primers 
for DNA sequence and hybridization (see, Hyrup, et at, 1 996, supra; Peny-OTCeefe, et aL, 
\996Vsupray ~ ' ^ - 

25 in another embodiment, PNAs of AMFX can be modified, e.g. , to enhance their 

stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
delivery known in the art For example, PNA-DNA chimeras of AMFX can be generated that 
may combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
30 recognition enzymes (e.g., RNase H and DNA polymerases) to interact with the DNA portion 
while the PNA portion would provide high binding affinity and f specificity. PNA-DNA 
chimeras can be linked using linkers of appropriate lengths selected in terms of base stacking, 
number of bonds between the nucleobases, and orientation '(see, Hyrup, etal., 1996. supra). 
The synthesis of PNA-DNA chimeras can be performed as described in Hyrup, et al, 1996. 
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supra and Finn, et a/., 1996. Nucl Acids Res 24: 3357-3363. For example, a DNA chain can be 
synthesized on a solid support using standard phosphoramidite coupling chemistry, and r 
modified nucleoside analogs, e.g., 5'-(4-methoxytrityl)amino-5 r -deoxy-thymidine 
phosphoramidite, can be used between the PNA and the 5 f end of DNA. See, e.g., Mag, et ciL, 
5 1 989. Nucl Acid Res 17: 5973-5988. PNA monomers are then coupled in a stepwise manner to 
produce a chimeric molecule with a 5' PNA segment and a 3 ? DNA segment. See, e.g., Finn, et 
ah, 1996. supra. Alternatively, chimeric molecules can be synthesized with a 5 1 DNA segment 
and a 3' PNA segment See, e.g., Petersen, et ah; 1975: Bioorg. MedChem. Lett 5: 
1119^11124. - • - •-.c-;^.-. • . . 

10 ; In other embodiments; the oligonucleotide may include other appended groups such as 

peptides (e.g. v for targeting host cell receptors in vivo),, or agents facilitating transport across 
the cell membrane {see, ^.v:Letsinger, er a/., 1989. Proc. Nail. Acad, ScL U.SIA, 86: 
6553-6556; Lemaitre^^^ Publication No. 

WG88/0981Q) or the btood-brain barrier (seet &g - PCT Publication No. WO 89/10134). In 

15 addition, oligonucleotides can. be: modified; with -hybridization triggered cleavage agents (see, 
e.g., Krol, et &/., 1988. Zon, 1988. 

Pharm. Res. 5: 539-549); To thi£ end* the oligonucleetideonay be conjugated to another 
molecule, e.g., a peptide, a hybridization triggered cross-linking agent, a transport agent, a; 
hybridization-triggered cleavage agent, and the like..: ■ ■■ 

20 AMFX Polypeptides 

A polypeptide according to the invention includes a polypeptide including the amino 
acid sequence of AMFX polypeptides whose sequences are provided in SEQ ID NOS:2, 4, 6, 
8, 10, 12, 14, 16, 18 and_2Q. The invention also includes a mutant or variant protein any of 
whose residues may be changed from the corresponding residues shown in SEQ ID NOS;2, 4, 

25 6, 8, 10,- 12, 14, 16, 18 and 20, while still encoding a protein that maintains its AMFX activities 
and physiological functions, or a functional fragment thereof. . . 

In general, an AMFX variant that preserves AMFX-like function includes any variant 
in which residues at a particular position in the sequenoe.have been substituted by other amino 
acids* and further include the possibility of inserting an additional residue or residues between 

30 two residues of the parent protein as well as the possibility of deleting one or more residues 
from the parent sequence. Any amino acid substitution, insertion, or deletion is encompassed 
by the invention. In favorable circumstances, the substitution, is a conservative substitution as 
defined above. 
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One aspect of the invention pertains to isolated AMFX proteins, and biologically-active 
portions thereof, or derivatives, fragments, analogs or homologs thereof. Also provided are 
polypeptide fragments suitable for use as immunogens to raise anti-AMFX antibodies. In one 
embodiment, native AMFX proteins can be isolated from cells or tissue sources by an 
5 appropriate purification scheme using standard protein purification techniques. In another 
embodiment, AMFX proteins are produced by recombinant DNA techniques. Alternative to 
recombinant expression, an AMFX protein or polypeptide can be synthesized chemically using 
standard peptide synthesis techniques. 

An "isolated" or "purified" polypeptide or protein or biologically-active portion thereof 

10 is substantially free of cellular material or other contaminating proteins from the cell or tissue 
source from which the AMFX protein is derived, or substantially free from chemical 
precursors or other chemicals when chemically synthesized. The language "substantially free 
of cellular material" includes preparations of AMFX proteins in which the protein is separated 
from cellular components of the cells from which it is isolated or recombinantly-produced. In 

15 one embodiment, the language "substantially free of cellular material" includes preparations of 
AMFX proteins having less than about 30% (by dry weight) of non- AMFX proteins (also 
referred to herein as a "contaminating protein"), more preferably less than about 20% of 
non- AMFX proteins, still more preferably less than about 1 0% of non- AMFX proteins, and i 
most preferably less than about 5% of non-AMFX proteins. When the AMFX protein or 

20 biologically-active portion thereof is recombinantly-produced, it is also preferably substantially 
free of culture medium, i.e., culture medium represents less than about 20%, more preferably 
less than about 10%, and most preferably less than about 5% of the volume of the AMFX 
protein preparation. 

The language "substantially free of chemical precursors or other chemicals" includes 
25 preparations of AMFX proteins in which the protein is separated from chemical precursors or 
other chemicals that are involved in the synthesis of the protein. In one embodiment, the 
language "substantially free of chemical precursors or other chemicals" includes preparations 
of AMFX proteins having less than about 30% (by dry weight) of chemical precursors or 
non-AMFX chemicals, more preferably less than about 20% chemical precursors or 
30 non-AMFX chemicals, still more preferably less than about 10% chemical precursors or 
non-AMFX chemicals, and most preferably less than about 5% chemical precursors or 
non-AMFX chemicals. 

Biologically-active portions of AMFX proteins include peptides comprising amino acid 
sequences sufficiently homologous to or derived from the amino acid sequences of the AMFX 
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proteins (e.g., the amino acid sequence shown in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 
20) that include fewer amino acids than the fall-length AMFX proteins, and exhibit at least one 
activity of an AMFX protein. Typically, biologically-active portions comprise a domain or 
motif with at least one. activity of the AMFX protein. A biologically-active portion of an 
5 AMFX protein can be a polypeptide. which is, for example, 10, 25, 50, -100 or more amino acid < 
residues in length. . , : ■ --v 

Moreover^ other biologically-active portions, in which other regions of the protein are 
deleted; can be prepared by: recombinant techniques an&evaluated for orie or more of the 
functional activities of a native AMFX protein*-n / 

10 In an embodment, the AMEX^rotein has an amino acid sequence shown in SEQ ID 

NOS:2,4,6 ; 8^10^12^14, ^ : 
substantially homologous to SEQ IDNOS:2,' 4* 6, 8; 1 0 9 12; :14^ 1 6jrtS and 20, and retains the 
functional activity of the protein ofSEQ ID NOS:2- 4,:6?8, 10,12, 14^16, 1 a and 20, yet^: 
dififers in amino acid Sequence: due ta nature 

15 detail, below. Accordingly, ^ 

comprises an amino acid sequence atleast about 45%: homologous ta ttofe amino acid sequence 
of SEQ ID NOS:2, 4; 6, 8, 10,: 12, 14, 1 6, 1 8 and 20, and retains the functional activity of the 
AMFX proteins of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20. *■ 
Determining Homology Between Two or ' More Sequences r ; : ™ ;r 

20 To detenninethe percent homology, of two anino arid sequences- or of two nucleic : 

acids, the sequences^are aligned for optimal comparison purposes (egiy gaps caitbe introduced 
in the sequence of a first amino acid or. nucleic acid sequence for optimal ■ alignment with a 
second amino or nucleic acidsequence). This amino acid residues or nucleotides at 
corresponding amino acid positions or nucleotide positions are then compared When a _ 

25 position in the first sequence is occupied by the same amino acid residue or nucleotide as the 
corresponding position in the second sequence; then the molecules are homologous at that 
position (r;e;, as used herein amina acid or nucleic acid "homology" is equivalent to amino acid 

or nucleic acid "identity"). > • : . 

The nucleic acid sequence homology may be determined as the degree of identity 

30 between two sequences.. The homology may be determined using computer programs known 
in the art, such as GAP software provided in the GGG program package.: See, Needleman and 
Wunsch, 1970. J Mot Biol 48: 443-453. Using GCG GAP software with the following settings 
for nucleic acid sequence comparison: GAP creation penalty of 5.0 and GAP extension penalty 
of 0.3, the coding region of the analogous nucleic acid sequences referred to above exhibits a 
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degree of identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99%, with 
the CDS (encoding) part of the DNA sequence shown in SEQ ID NOS: 1 , 3, 5, 7, 9, 1 1 , 1 3, 1 5, 
17 and 19. 

The term "sequence identity" refers to the degree to which two polynucleotide or 
5 polypeptide sequences are identical on a residue-by-residue basis over a particular region of 
comparison. The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over that region of comparison, determining the number of 
positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case of 
nucleic acids) occurs in both sequences to yield the number of matched positions, dividing the 
1 0 number of matched positions by the total number of positions in the region of comparison (/. e. , 
the window size), and multiplying the result by 100 to yield the percentage of sequence 
identity; The term "substantial identity" as used herein denotes a characteristic of a 
polynucleotide sequence, wherein the polynucleotide comprises a sequence that has at least 80 
percent sequence identity, preferably at least 85 percent identity and often 90 to 95 percent 
1 5 sequence identity, more usually at least 99 percent sequence identity as compared to a 
reference sequence over a comparison region. 
Chimeric and Fusion Proteins 

The invention also provides AMFX chimeric or fusion proteins. As used herein, an 
AMFX "chimeric protein" or "fusion protein" comprises an AMFX polypeptide operatively- 

20 linked to a non-AMFX polypeptide. An "AMFX polypeptide" refers to a polypeptide having 
an amino acid sequence corresponding to an AMFX protein (SEQ ID NOS:2, 4, 6, 8, 10, 12, 
14, 16, 18 and 20), whereas a "non-AMFX polypeptide" refers to a polypeptide having an 
amino acid sequence corresponding to a protein that is not substantially homologous to the 
AMFX protein, e.f 5 a protein that is different from the AMFX protein and that is derived from 

25 the same or a different organism. Within an AMFX fusion protein the AMFX polypeptide can 
correspond to all or a portion of an AMFX protein. In one embodiment, an AMFX fusion 
protein comprises at least one biologically-active portion of an AMFX protein. In another 
embodiment, an AMFX fusion protein comprises at least two biologically-active portions of an 
AMFX protein. In yet another embodiment, an AMFX fusion protein comprises at least three 

30 biologically-active portions of an AMFX protein. Within the fusion protein, the term 

"operatively-linked^cis intended to indicate that the AMFX polypeptide and the non-AMFX 
polypeptide are fused in-frame with one another. The non-AMFX polypeptide can be fused to 
the N-terminus or C-terminus of the AMFX polypeptide. 
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In one embodiment, the fusion protein is a GST-AMFX fusion protein in which the 
AMFX sequences are fused to the C-terminus of the GST (glutathione S-transferase) 
sequences. Such fusion proteins can facilitate the purification of recombinant AMFX 
polypeptides. 

5 In mother embodiment, the fusion protein is ah AMFX protein containing a 

heterologous signal sequence at its N-tenninus: In certain host cells (e.g. , mammalian host 
cells), expression and/or secretion ^f AMFX can be increased through use of a heterologous 
signaT sequence; -'- :l " - r 'i-r^ :..-*. — >■■. ■ • 

In yet smother embodiment, the' fusion; protein is an AMFX-immuhoglobulin fusion 

10 protein in which the AMFX sequences are fused to sequences derived from a member of the 
immunoglobulin pfotem fahiily : ; The AMFX'-immunoglobulin fusion proteins of the invention 
can be incorporated into phaimaceutical cbm to a subject to inhibit 

an interaction between ah^AMFX ligahd arid ah AMFX protein on the surface of a ceil,vto' 
thereby suppress AMFX-mediated signal frahstduetibri m vivfe 1 !^ 

i 5 fusion proteins can be used to affect the bioavailability ofib AMFX cognate ligahd; Inhibition 
of the AMFX ligand/AMFX interaction may be useful therapeuticairy for both the treatment of 
proliferative and differehtiative disorders^ as Well -£s modulating 1 (eg: promoting or inhibiting) 
ceirsurvival. Moreover; the AMFX-iiiunimoglobulm fusion proteiris-of the invention can be 
used as immunogens to produce anti-AMFX antibodies in a subject, to purify AMFX ligands, 

20 and in screening assays to identify molecules that inhibitthe interaction of AMFX with an 
AMFX ligahd: ^ - — " — ---h - - " ---- • : ~- - ; ■ 

An AMFX chimeric or fusion protein of the invention can be produced by standard 
recombinant DN A techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 

25 techmqiiesre.g:, by employing blunt-ended or stagger-ended termini for ligation, restriction 
. enzyme digestion to provide for appropriate termini, fUling-iti of cohesive ends as appropriate, 
alkaline phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In another 
embodiment, the fusion gene can be synthesized by conventional techniques including 
automated DNA synthesizers: Alternatively, PCK amplification of gene fragments can be 

30 carried out using anchor primers that give rise to complementary overhangs between two 
consecutive gene fragments that can subsequently be annealed and reamplified to generate a 
chimeric gene sequence (see; e.g., Ausubel, et al. (eds.) Current Protocols in Molecular 
Biology, John Wiley & Sons, 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide). An AMFX-encoding 
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nucleic acid can be cloned into such an expression vector such that the fusion moiety is linked 
in-frame to the AMFX protein. 

AMFX Agonists and Antagonists 

The invention also pertains to variants of the AMFX proteins that function as either 
5 AMFX agonists (/. e. t mimetics) or as AMFX antagonists. Variants of the AMFX protein can 
be generated by mutagenesis {e.g., discrete point mutation or truncation of the AMFX protein). 
An agonist of the AMFX protein can retain substantially the same, or a subset of, the biological 
activities of the naturally occurring form of the AMFX protein. An antagonist of the AMFX 
protein can inhibit one or more of the activities of the naturally occurring form of the AMFX 

10 protein by, for example, competitively binding to a downstream or upstream member of a 

cellular signaling cascade which includes the AMFX protein. Thus, specific biological effects 
can be elicited by treatment with a variant of limited function. In one embodiment, treatment 
of a subject with a variant having a subset of the biological activities of the naturally occurring 
form of the protein has fewer side effects in a subject relative to treatment with the naturally 

1 5 occurring form of the AMFX proteins. 

Variants of the AMFX proteins that function as either AMFX agonists mimetics) 
or as AMFX antagonists can be identified by screening combinatorial libraries of mutants {e.g., 
truncation mutants) of the AMFX proteins for AMFX protein agonist or antagonist activity. In 
one embodiment, a variegated library of AMFX variants is generated by combinatorial 

20 mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A variegated 
library of AMFX variants can be produced by, for example, enzymatically ligating a mixture of 
synthetic oligonucleotides into gene sequences such that a degenerate set of potential AMFX 
sequences is expressible as individual polypeptides, or alternatively, as a set of larger fusion 
proteins {e.g., for phage display) containing the set of AMFX sequences therein. There are a 

25 variety of methods which can be used to produce libraries of potential AMFX variants from a 
degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can 
be performed in an automatic DNA synthesizer, and the synthetic gene then ligated into an 
appropriate expression vector. Use of a degenerate set of genes allows for the provision, in one 
mixture, of all of the sequences encoding the desired set of potential AMFX sequences. 

30 Methods for synthesizing degenerate oligonucleotides are well-known within the art. See, e.g., 
Narang, 1983. Tetrahedron 39: 3; Itakura, etal., 1984. Annu. Rev. Biochem. 53: 323; Itakura, 
et al. 7 1 984. Science 1 98: 1 056; Ike, et al, 1 983 . Nucl Acids Res. 1 1 : 477. 
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Polypeptide Libraries 

In addition* libraries of fragments of the AMFX protein coding sequences can be used 
to generate a variegated population of AMFX fragments for screening and subsequent selection 
of variants of an AMFX protein. In one embodiment, a library of coding sequence fragments 
can-be generated by treating a double stranded PCR fragment of an AMFX coding sequence 
with a nuclease under conditions wherein nicking occurs only about once per molecule, 
denaturing the double stranded DNA, renaturing the: DNA to form double-stranded DNA that 
can include sense/antisense pairs from different nicked products, removing single stranded 
portions from reformed duplexes by treatment with S\ nuclease^ and ligating the resulting 
fragment library into an expression vectOT. By this method* expression libraries can be derived 
which encodes N-tenninal and internal fragments of various sizes of the AMFX proteins. 

; : Various techniques are known in the art for screening gene products of combinatorial 
libraries made by point mutations or truncation^ and for screening cDNA" libraries for gene;, 
products having a selected property. Such techniques are adaptable for rapid screening of the 
gene libraries generated by tte combinatorial m^ The most 

widely used techniques* whicfr are: amenable to high throughput analysis, for screening large 
gene libraries typically i include cloning the gene library into replicable expression vectors, 
transfonning appropriate cells with the resulting library of vectors^ and expressing the - ; 
combinatorial genes under conditions in which detection of a desired activity facilitates 
isolation of the vector encoding the gene whose product was detected. Recursive ensemble 
mutagenesis (REM)^a new technique that enhances the frequency of functional mutants in the 
libraries, can be used in combination with the screening assays to identify AMFX variants. 
See, e.g., Arkin and Yourvan, 1992. Proc} Natl, Acadt Set USA 89: 781 1-7815; Delgrave, et 
al;; 1993, Protein Engineering 6:327-331; L . 

Anti-AMFX Antibodies 

_The invention encompasses antibodies and antibody fragments, such as F a b or (F a b)2, 

that bind immunospecifically to any of the AMFX polypeptides of said invention. 

An isolated AMFX protein, or a portion or fragment thereof, can be used as an 
immunogen to generate antibodies that bind to AMFX polypeptides using standard techniques 
for polyclonal and monoclonal- antibody preparation. The full-length AMFX proteins can be 
used or, alternatively, the invention provides antigenic peptide fragments of AMFX proteins 
for use as immunogens. The antigenic AMFX peptides comprises at least 4 amino acid 
residues of the amino acid sequence shown in SEQ ID NO NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 
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and 20 s and encompasses an epitope of AMFX such that an antibody raised against the peptide 
forms a specific immune complex with AMFX. Preferably, the antigenic peptide comprises at 
least 6, 8, 10, 15, 20, or 30 amino acid residues. Longer antigenic peptides are sometimes 
preferable over shorter antigenic peptides, depending on use and according to methods well 
5 known to someone skilled in the art. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of AMFX that is located on the surface of the protein {e.g. , a 
hydrophilic region). As a means for targeting antibody production, hydropathy plots showing 
regions of hydrophilicity. and hydrophobicity may be/generated by any method well known in 

10 the art, including, for example, the Kyte Doolittle or the Hopp Woods methods, either with or 
without Fourier transformation (see, e.g., Hopp and Woods, 1981. Proa Nat. Acad Set USA 
78: 3824-3828; Kyte and Doolittle, 1982. J.MoL Biol. 157: 105-142, each incorporated herein 
by reference in their entirety). 

As disclosed herein, AMFX protein sequences of SEQ ID NOS.2, 4, 6, 8, 10, 12, 14, 

15 1 6, 1 8 and 20, or derivatives, fragments, analogs or homologs thereof, may be utilized as 
immunogens in the generation of antibodies that immunospecifically-bind these protein 
components. The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologieally-active portions of immunoglobulin molecules, i.e., molecules that contain an 
antigen binding site that specifically-binds (immunoreacts with) an antigen, such as AMFX. 

20 Such antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, 
Fab and F( a b*)2 fragments, and an F ab expression library . In a specific embodiment, antibodies to 
human AMFX proteins are disclosed. Various procedures known within the art may be used 
for the production of polyclonal or monoclonal antibodies to an AMFX protein sequence of 
SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 1 8 and 20, or a derivative, fragment, analog or homolog 

25 thereof. Some of these proteins are discussed below. 

Also included in the invention are antibodies to AMFX proteins, or fragments of 
AMFX proteins. The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that 
contain an antigen binding site that specifically binds (immunoreacts with) an antigen. Such 

30 antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, F a b, 
F a b» and F(ab')2 fragments, and an F^ expression library. In general, an antibody molecule 
obtained from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ 
from one another by the nature of the heavy chain present in the molecule. Certain classes 
have subclasses as well, such as IgGi, IgG 2 , and others. Furthermore, in humans, the light . 
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chain may be a kappa chain or a lambda chain. Reference herein to antibodies includes a 
reference to all such classes, subclasses and types of human antibody species. 

An isolated AMFX-related protein of the invention may be intended to serve as an 
antigen, or a portion or fragment thereof, and additionally can be used as an immunogen to 
generate antibodies that immunospecifically bind the antigen, using standard techniques for 
polyclonal and monoclonal antibody preparation; The fiUl-length~protein can be used or, 
alternatively , the invention provides antigenic peptide fragments of the antigen for use as 
immunogens. An antigenic peptide, fragment comprises atleast-6 aminoacid residues of the 
amino acid sequence of the full length protein and encompasses an epitope thereof such that an 
antibody raised against: the peptide foims a specific iinmune complex with the full length 
protein or with any fragment that contains the epitopes Preferablyvthe antigenic peptide, 
comprises at least: lO amino acid residues; orut leasfc 15 aminaackr residues, or at least 20 
amino acid residues* or at least JQamihaacidvresidre^ by the 

antigenic peptide are regions- rftterprotein that art located oirits.wx&LCe; commonly these are 
hydrophilic regions. t -ui : . /. -V. -c;' -V^" 

In certain embodiments of the invention; at least on&epitope encompassed by the 
antigenic peptide is a region of AMFX-related protein thafcis located on the surface of the 
protein, e.g., a hydrophilic region. A hydrophobicity analysis of the human AMFX-related 
protein sequence will indicate which regions of a AMFX-related protein are particularly 
hydrophilic and, therefore; -areilikely to encode surface residues useful for targeting antibody 
production. As a means for targeting antibody production, hydropathy plots showing regions of 
hydrophilicity and hydrophobicity may be generated by any method well known in the art, 
including, for example, the Kyte Ddolittle or the Hopp Woods methods; either with or without 
Fourier transformation. See, ie g. y Hopp and Woods, 1 9%\~?roe:Ntit. Acad Set USA 78: 3824- 
3828; Kyte and Doolittle 1982, J- Mai. Biol. 157: 105-142, each of which is incorporated 
herein by reference in its entirety; Antibodies that are specific for one or more domains within 
an antigenic protein^ or derivatives, fragments, analogs or hbmblogs thereof, are also provided 
herein.' 1 " " ■ - -■ - ■ - ■:• 

A protein of the invention, or a derivative, fra^nent, analog, homolog or ortholog 
thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. - 

Various procedures known within the art may be used for the production of polyclonal 
or monoclonal antibodies directed against a protein of the invention, or against derivatives, 
fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: A Laboratory 
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Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, incorporated herein by reference). Some of these antibodies are discussed below. 
Polyclonal Antibodies 

For the production of polyclonal antibodies, various suitable host animals (e.g., rabbit, 
goat, mouse or other mammal) may be immunized by one or more injections with the native 
protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring immunogenic 
protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 
recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated 
to a second protein known to be immunogenic in the mammal being immunized. Examples of 
such immunogenic proteins include, but are.not limited to keyhole limpet hemocyanin, serum 
albumin, bovine thyroglobulin, and soybean trypsin inhibitor. The preparation can further 
include an adjuvant Various adjuvants used to increase the immunological response include, 
but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., aluminum 
hydroxide), surface active substances (e.g., lysolecithin, pluronic polyols, polyanions, peptides, 
oil emulsions, dinitrophenol, etc.), adjuvants usable in humans such as Bacille 
Calmette-Guerin and Corynebacterium parvum, or similar immunostimulatory agents. 
Additional examples of adjuvants which can be employed include MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 

The polyclonal antibody molecules directed against the immunogenic protein can be 
isolated from the mammal (e.g., from the blood) and further purified by well known . 
techniques, such as affinity chromatography using protein A or protein G, which provide 
primarily the IgG fraction of immune serum. Subsequently, or alternatively, the specific 
antigen which is the target of the immunoglobulin sought, or an epitope thereof, may be 
immobilized on a column to purify the immune specific antibody by immunoaffinity 
chromatography. Purification of immunoglobulins is discussed, for example, by D. Wilkinson 
(The Scientist, published by The Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 
2000), pp. 25-28). 
Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as 
used herein, refers to a population of antibody molecules that contain only one.molecular 
species of antibody molecule consisting of a unique light chain gene product and a.unique 
heavy chain gene product In particular, the complementarity determining regions (CDRs) of 
the monoclonal antibody are identical in all the molecules of the population. MAbs thus 
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contain an antigen binding site capable of immunoreacting with a particular epitope of the 
antigen characterized by a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature. 256:495 (1975). In a hybridoma method, a mouse, 
hamster, or other appropriate host animal, is typically immunized with an immunizing agent to 
elicit lymphocytes that produce or are capable of producing antibodies that will specifically 
bind to the immunizing agent; Alternatively, the lymphocytes can be immunized in vitro: 
- r yr Hie immunizing agent will typically include the protein antigen- a fragment thereof or a 
fusion protein thereof; Generally, either peripheral blood lymphocytes are used if cells of 
human origin are d^iired; 6t sirfeert celts ot lymph node cells are used if non-human 
mammalian sources are desired:' The lymphocytes arethen fused with an immortalized cell 
line using a suitable fusing agent,- such 5 as polyethylene glycol^ to form a hybridoma cell 
(Goding. Monoclonal Antibodies: Pririci^es and Practice- Academic Piress, (1 986>pp. 59- 
103). Immortalize* celt lines ate usually ttansifbrmed rhMimaliari cells; particularly myeloma 
cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are 
employed. The hybridoma cells can be cultured in a suitable culture medium that preferably 
contains one or more substances that inhibit the growth or Survival of the unfiised; 
immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas 
typically will include hyjK)xmtfaiiie, amiiH>pt&iri, and thymidine ("HAT medium"), which 
substances prevent the growth of HGPRT-defitient cells. 

Preferred immortalized cell Imesare those that fuse efl 
level expression of antibody by the selected ahtibbdy-prodiicing cells, and are sensitive to a 
medium such as HAT medium: -More preferred immortalized" cell lines are murine myeloma 
lines, which can be obtained; for instance, from the Salk Institute Cell Distribution Center, San 
Diego, California and the American Type Culture Collection, Manassas, Virginia. Human 
myeloma and mouse-human heteromyeloma cell lines also have been described for the 
production of human monoclonal antibodies (Kozbor. J. Immunol, 133 :3001 (1984); Brodeur 
et al., Monoclonal Antibody Production Techniques and Applications. Marcel Dekker, Inc., 
New York, (1987) pp. 51-63). 

The culture medium in which the hybridoma cells are cultured can then be assayed for 
the presence of monoclonal antibodies directed against the antigen. Preferably, the binding 
specificity of monoclonal antibodies produced by the hybridoma cells is determined by 
immunoprecipitation or by an in vitro binding assays such as radioimmunoassay (RIA) or 
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enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in 
the art. The binding affinity of the monoclonal antibody can, for example, be determined by 
the Scatchard analysis of Munson and Pollard, Anal. Biochem. . 107:220 (1 980). Preferably, 
antibodies having a high degree of specificity and a high binding affinity for the target antigen 
5 are isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by 
limiting dilution procedures and grown by standard methods. Suitable culture media for this 
purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. 
Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

10 The monoclonal antibodies secreted by the subclones can be isolated or purified from 

the culture medium or ascites fluid by conventional immunoglobulin purification procedures 
such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel 
electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 

15 those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the 
invention can be readily isolated and sequenced using conventional procedures (e.g., by using 
oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and 
light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred 
source of such DNA. Once isolated, the DNA can be placed into expression vectors, which are 

20 then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, 
or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the 
synthesis of monoclonal antibodies in the recombinant host cells. The DNA also can be 
modified, for example, by substituting the coding sequence for human heavy and light chain 
constant domains in place of the homologous murine sequences (U.S. Patent No. 4,8 16,567; 

25 Morrison, Nature 368, 812-13 (1994)) or by covalently joining to the immunoglobulin coding 
sequence all or part of the coding sequence for a non-immunoglobulin polypeptide. Such a 
non-immunoglobulin polypeptide can be substituted for the constant domains of an antibody of 
the invention, or can be substituted for the variable domains of one antigen-combining site of 
an antibody of the invention to create a chimeric bivalent antibody. 

30 Humanized Antibodies 

Th£ antibodies directed against the protein antigens of the invention can further 
comprise humanized antibodies or human antibodies. These antibodies are suitable for 
administration to humans without engendering an immune response by the human against the 
administered immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins, 
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immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab')2 or other antigen- 
binding subsequences of antibodies) that are principally comprised of the sequence of a human 
immunoglobulin, and contain minimal sequence derived from a non-human immunoglobulin. 
Humanization can be performed following the method of Winter and co-workers (Jones et al., 
5 Nature,: 321:522-525 (1986); Riechmann et -at ■ Nature, 332:323-327 (1 988); Verhoeyen et at, 
Science, 239: 1 534-1-536 (1988)), by substituting rodent CDKs or CDR sequences for the 
corresponding sequences of a human antibody. (See also U.S. Patent No. 5,225,539.) In some 
instances, Fv framework residues of the human immtmo^obiilki are replaced by corresponding 
non-human residues^ Humanized 'antibodies can also <2^mprise residues which are found 
10 neitherin the recipient antibody nor M ^ th^imported GDJfcor fi^ew^ se^eiices : r In general, 
the humanized antibody wil t 

variable domains, in which all of substantially all of the CDR regions ^o^n^spondtbtiibse of a 
non-human immunoglobulin- afid att or substantially allSf the^fra»6Work regions are those of a 
human immunoglobulin consensus sequence. The humanized ai^bod^ d^timally also will 
15 comprise at least a portion of art immunoglobulin constant regton (Fc>, tj^pically that of a 
human immunoglobulin (Jraes et •■■01.* 1986pRiechmantte?t aky 198^; arid Presto Gurt; Op. 
Struct. Biol.. 2:593-596 (1992)), 

Human Antibodiea ~- \^.-*^yc.^~ ,;r * ^ - >--<i--- 

FuUy human antibodies relate ta ^ 

20 sequences of both Ae: light chain and the heavy chains including the GDRsf arise from hianan 
genes. Such antibodies are termed "human antibodies'*; <>r "ftilly human antibodies^ herein. 
Human monoclonal antibodies can be prepared by the fribma te^hhiqtiel^the^hirinan B-cell 
hybridoma technique (seeKoztor^et al.; 1983 Immunol Today 4v 72) 2nd the EBV hybridoma 
technique to produce human monoclonal antibodies (see; Cole, et at, 1985 In: Monoclonal 

25 Antibodies and Cancer Therapy, Alan R. Liss, Inc;v pp* 77-96). Human monoclonal 

antibodies may be utilized in the practice of the present invention and may be produced by 
using human hybridomas (see Cote> et aLy 1983v Pfoc Natl Acad Sci USA 80: 2026-2030) or 
by transforming human B-cells with Epstein Barr Virus in vitro (see Cole? et al., 1985 In: 
Monoclonal Antibodies and Cancer Therapy, Alan TL Liss, Incj, pp. 77-96); 

30 In addition; human antibodies can also be produced using additional techniques, 

including phage display libraries (Hoogenboom and Winter: J. Moh Biol.. 227:381 (1991); 
Marks et al., J. Mol. BioL. 222:58 1 (1 991)). Similarly, human antibodies can be made by 
introducing human immunoglobulin loci into transgenic animals, e.g. , mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated: Upon 
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challenge, human antibody production is observed, which closely resembles that seen in 
humans in all respects, including gene rearrangement, assembly, and antibody repertoire. This 
approach is described for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 
5,625,126; 5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10, 779-783 (1992)); 
5 Lonberg et al. ( Nature 368 856-859 (1994)); Morrison ( Nature 368 . 812-13 (1994)); Fishwild 
et aU Nature Biotechnology 14. 845-51 (1996)); Neuberger fNature Biotechnology 14. 826 
( 1 996)); and Lonberg and Huszar ( Intern. Rev. Immunol. 13 65-93 (1 995)). 

Human antibodies may additionally be produced using transgenic nonhuman animals 
which are modified so as to produce fully human antibodies rather than the animal's 

10 endogenous antibodies in response to challenge by an antigen. (See PCT publication 

WO94/02602). The endogenous genes encoding the heavy and light immunoglobulin chains in 
the nonhuman host have been incapacitated, and active loci encoding human heavy and light 
chain immunoglobulins are inserted into the host's genome; The human genes are 
incorporated, for example, using yeast artificial chromosomes containing the requisite human 

1 5 DNA segments. An animal which provides all the desired modifications is then obtained as 
progeny by crossbreeding intermediate transgenic animals containing fewer than the full 
complement of the modifications. The preferred embodiment of such a nonhuman animal is a 
mouse, and is termed the Xenomouse™ as disclosed in PCT publications WO 96/33735 and 
WO 96/34096. This animal produces B cells which secrete fully human immunoglobulins. 

20 The antibodies can be obtained directly from the animal after immunization with an 

immunogen of interest, as, for example, a preparation of a polyclonal antibody, or alternatively 
from immortalized B cells derived from the animal, such as hybridomas producing monoclonal 
antibodies. Additionally, the genes encoding the immunoglobulins with human variable 
regions can be recovered and expressed to obtain the antibodies directly, or can be further 

25 modified to obtain analogs of antibodies such as, for example, single chain Fv molecules. 

An example of a method of producing a nonhuman host, exemplified as a mouse, 
lacking expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent 
No. 5,939,598. It can be obtained by a method including deleting the J segment genes from at 
least one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of 

30 the locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy chain 
locus, the deletion being effected by a targeting vector containing a gene encoding a selectable 
marker; and producing from the embryonic stem cell a transgenic mouse whose somatic and 
germ cells contain the gene encoding the selectable marker. 
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A method for producing an antibody of interest, such as a human antibody, is disclosed 
in U.S. Patent No. 5,916,771 . It includes introducing an expression vector that contains a 
nucleotide sequence encoding a heavy chain into one mammalian host cell in culture, 
introducing an expression, vector containing a nucleotide sequence encoding a light chain into 
another mammalian host celly and fusing the two cells to form a hybrid celL The hybrid cell 
expresses an antibcniy containing the heavy chain and the light chain. 

In a furtherimprovement on this procedure, a method for identifying a clinically 
relevantepitope on am immunogen* and ^correlative method for selecting- an antibody that 
binds immunospecifically to thePrelevarit epitope with high affinity , are disclosed m PCT 
pubKcation.WO J 99/5:3G49tH£ ;M0Uv:£ E:^bkvu, C*Ad p. ii^H nb r : ; — 
Fib Fragments, and SinglfcChaiitAntibodies iW- r.r-.: ..u-v. - r,i*ji d: 

, , v; Acc»rdii^ to the invent production of : . 

single^chain antibodies specific to an antigenic protein of the invention (see e^gi, U.S. Patent: 
Noi 4*946^778). ^In addition/ melhodsjcanbeadapted^ of Fab expression 

libraries;(see e;g^ Huse, elat ; J989 Scienc»24d: 12354128i>>tD : . allow rapid and effective^ 
identification of monoclonal fragments with the desired specificity for a protein or 
derivatives; fragments, analogs or homologs thereo£ Antibody fragments that contain the 
idiotypes to a ; protein antigen may bejn-oducedby techniques knownun the art including, but 
not limited tou(i) an F^t* fragment produced by pepsin digestion of an antibody molecule; (ii) 
an Fab fragment generated by reducing the. disulfide bridges, of an F( ab -)2 fragment; (Hi) an F ab 
figment generated by the treatment of the antibodyr molecule with papain and a reducing agent 

and(iv)F v fragnKI3l3.Ai -I X = jhu ^-::.'::::;^.;.". ^ • ::: 

Bispecific Antibodies- : :;u;:;.:u :i; vr-^-iu-u..;^ -m- -ria _ -■ 

v Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 
have binding specificities for at least two different antigens.: In the present case, one of the 
binding specificities, is for an antigenic protein of the invention. The second binding target is 
any other: antigen, and advantageously is a ceU-surface>protein or receptor or receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
recombinant production of bispecific antibodies is basedon the co-expression of two 
immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and GueUo^Nature, 305:537.539 (1 933)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce 
a potential mixture of ten different antibody molecules, of which only one has the correct 
bispecific structure. The purification of the correct molecule is usually accomplished by 
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affinity chromatography steps. Similar procedures are disclosed in WO 93/08829, published 
13 May 1993, and in Traunecker et al, 1991 EMBOJ., 10:3655-3659. 

Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 
5 preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part of 
the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region 
" (CHI) containing the site necessary for light-chain binding present in at least one of the 
fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the 
immunoglobulin light chain, are inserted into separate expression vectors, and are co- 
1 0 transfected into a suitable host organism. For further details of generating bispecific antibodies 
see, for example, Suresh et al., Methods in Enzvmologv . 121:210 (1986). 

According to another approach described in WO 96/270 1 1 , the interface between a pair 
of antibody molecules can be engineered to maximize the percentage of heterodimers which 
are recovered from recombinant cell culture. The preferred interface comprises at least a part 
15 of the CH3 region of an antibody constant domain. In this method, one or more small amino 
acid side chains from the interface of the first antibody molecule are replaced with larger side 
chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the 
large side chain(s) are created on the interface of the second antibody molecule by replacing 
large amino acid side chains with smaller ones (e.g. alanine or threonine). This provides a 
20 mechanism for increasing the yield of the heterodimer over other unwanted end-products such 
as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments 
(e.g. F(ab')2 bispecific antibodies). Techniques for generating bispecific antibodies from 
antibody fragments have been described in the literature. For example, bispecific antibodies 

25 can be prepared using chemical linkage. Brennan et al., Science 229:8 1 (1985) describe a 

procedure wherein intact antibodies are proteolytically cleaved to generate F(ab') 2 fragments. 
These fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to 
stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab 5 fragments 
generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 

30 derivatives is then reconverted to. the Fab' -thiol by reduction with mercaptoethylamine and is 
mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific 
antibody. The bispecific antibodies produced can be used as agents for the selective 
immobilization of enzymes. 
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Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) ^ 
describe the production of a fully humanized bispecific antibody F(ab') 2 molecule. Each Fab' 
fragment was separately secreted from E. coli and subjected to directed chemical coupling in 
5 vitro to form the bispecific antibody; The bispecific antibody thus formed was able to bind to 
cells overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic 
activity of human cytotoxic lymphocytes against human breast tumor targets; 

Various techniques for making and isolating bispecific antibody fragments directly 
from recombinant cell culture have also been described. Foe example, bispecific antibodies 

1 0 have been produced-using ■ leucine zippers: 7 KdStelny et ahy -J: Immunol. 1 48(5): 1 547-1 553 
(1992).. The lieuciire zipper peptides from the Fx>s and Jun proteins were linked to the Fab' 
portions of two different antibodies hy gene fusion; Th^antibody homodimers were reduced at 
the hinge: region to form monomers and then re^oxidized to forfo the totibody hetef bdime^v 
This method can also be utilized for the production Of antibody hotockhmers- The ^diabody" 

1 5 technology described by HolKnger et al., Proa Natl; Acad; Sck USA 90:6444^-644^(1 993) has 
provided an alternative mechanism fo* making bispecific antibody fragments. The;fragmettts 
comprise a heavy-chain variable domain (Vh) connected to; a light-chain variable domain (V L ) 
by a linker which is to& short to allow pairing between the two domains on the same chain. 
Accordingly^ the and Vl domains of one fragment are forced to pair with the ; :■ 

20 . complementary Vl and Vm domains of another fragment, thereby forming two antigen-binding . 
sites. Anothra strategy for making bispecific antibody fragments by the use of single-chain Fv 
(sFv) dimers has also be^ reported. See, Gruber et al„ J. Immunol: 152:5368 f 1994). 

Antibodies with more than two valencies: are* contemplated; For example, trispecific 
antibodies canbe prepared; Tutt et alu J. Immunol. 147:60 (19911 

25 r - Exemplary bispecific antibodies can bind to two different epitopes, at least one of 

which originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm of 
an immunoglobulin molecule cam be combined with an arm which binds to a triggering 
molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or 
Fc receptors for IgG (Fc R), such asFc RI (CD64) s Fc RII (CD32) andFc RIH (CD16) so as 

30 to focus cellular defense mechanisms to the cell expressing the particular antigen. Bispecific 
antibodies can also be used to direct cytotoxic agents to cells which express a particular 
antigen^- These antibodies possess an antigen-binding arm and an arm which binds a cytotoxic 
agent or a radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another 
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bispecific antibody of interest binds the protein antigen described herein and further binds 
tissue factor (TF). 
Heteroconjugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies 
have, for example, been proposed to target immune system cells to unwanted cells (U.S. Patent 
No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 92/200373; EP 
03089). It is contemplated that the antibodies can be prepared in vitro using known methods in 
synthetic protein chemistry, including those involving crosslinking agents. For example, 
immunotoxins can be constructed using a disulfide exchange reaction or by forming a thioether 
bond. Examples of suitable reagents for this purpose include iminothiolate and methyl-4- 
mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 4,676,980. 
Effector Function Engineering 

- It can be desirable to modify the antibody of the invention with respect to effector 
function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For 
example, cysteine residue(s) can be introduced into the Fc region^ thereby allowing interchain 
disulfide bond formation in this region. The homodimeric antibody thus generated can have 
improved internalization capability and/or increased complement-mediated cell killing and 
antibody-dependent cellular cytotoxicity (ADCC). See Caron et aL, J. Exp Med., 176: 1191- 
1 195 (1992) and Shopes, J. Immunol, 148: 2918-2922 (1992), Homodimeric antibodies with 
enhanced anti-tumor activity can also be prepared using heterobifunctional cross-linkers as 
described in Wolff et al Cancer Research, 53: 2560-2565 (1993). Alternatively, an antibody 
can be engineered that has dual Fc regions and can thereby have enhanced complement lysis 
and ADCC capabilities. See Stevenson et al, Anti-Cancer Drug Design, 3: 219-230 (1989). 
Immunoconjugates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to 
a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin 
of bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope (i.e., 
a radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been 
described above. Enrymatically active toxins and fragments thereof that can be used include 
diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain (from 
Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, 
Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and 

79 



WO 01/74897 



PCT/US01/10892 



PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, 
mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of 
radionuclides are available for the production of radioconjugated antibodies. Examples include 

^BU^J^In^and^Re. 

5 Conjugates.of the antibody and cytotoxic agent are ^^^made using a variety of bifunctional 

protein-qoupling agents such as N-succinimidyl-3-(2-pyridyldithioI) propionate (SPDP), 
iminothiolane (IT), bifimctipnalr d^yative^of kmdoe^ter^ (such as dimethyl adipimidate 
. HCL), active esters (such: as .disucc^midyi suba^te^ ialdehydles r (sueh as glutareldehyde), bis- 
azido compounds (such as bis (p-gzidobenzoy 1) hexanediamine), bis-diazonium derivatives 

1 0 (such as bis r (F^iazQ^TOb^^ 2,6- 
diispcyanate)^ and . jtu^apgye ftupriflft Wjnp9U^»(OT^b §^,5 T di£toQ^ 
For example, a ripin.. in^iui^^c^n be ; pi^^^^4escnbedJ^ Science, %38: 

1098 09?;ft-^ iv> idi : ; 

triamiMpentaacetic Mid;^ 

1 5 radiqnucleotide. to the antibody, : . See ? , PublicaliQii ^G9£/U€2& c : > T :r . : 
, In another; emtoj^^t^t^ a "receptor" (such 

streptavidin) for utUi^tiqi^ in tymoi: pretargetipg; >yh§^v.di(B antibody-receptor conjugate is 
administered to the p^etnVfoUQwe4 Ivyjremqval of- unbound conjugate from the circulation 
using a clearing agent and then administration of a "ligand" (e.g. , avidin) that is in turn 

20 conjugated to a cytotQjcici^geirf rl . v ^ . : ,,,, ? ^,-, JiT>; -!,. . riT ^ r .., rS: i v , r w :V , rr _ r ^ > v „ 

AMFX Recombinant Expression Vectors and Host Cells 

Anqthe^ aspect of the invention pertains^ vectors, preferably expression vectors, 

containing a nucleic, apid ^ncqding 7 ^n AMFX protein^ or derivatives, ^fragments, analogs or 
homologs foereqf. As. used herein, the term "vector" refers, to a nucleic acid molecule capable 

25 of transporting another nucleic acid to. which it has been linked. One type of vector is a 

"plasmid", which refers. to a circular double stranded DNA loop into which additional DNA 
segments can be ligated. Another type of vector is a viral vector, wherein additional DNA 
segments can be ligated into the viral genome.„ Certain vectors are capable of autonomous 
replication in a host cell into which they are introduced (e.g., bacterial vectors having a 

30 bacterial origin of replication and episomal mammalian vectors). Other vectors (e.^., 
non-episomal mammalian vectors) are integrated into .the genome of a host cell upon 
introduction into the host cell, and thereby are replicated along with the host genome. 
Moreover, certain vectors are capable of directing the expression of genes to which they are 
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operatively-linked. Such vectors are referred to herein as "expression vectors". In general, 
expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. 
In the present specification, "plasmid" and "vector" can be used interchangeably as the plasmid 
is the most commonly used form of vector. However, the invention is intended to include such 
5 other forms of expression vectors, such as viral vectors (e.g. , replication defective retroviruses, 
adenoviruses and adeno-associated viruses), which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
invention in a form suitable for expression of the nucleic acid in a host cell, which means that 
the recombinant expression vectors include one or more regulatory sequences, selected on the 

1 0 basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid 
sequence to be expressed. Within arecombinant expression vector, "operably-linked" is 
intended to mean that the nucleotide sequence of interest is linked to the regulatory sequence(s) 
in a manner that allows for expression of the nucleotide sequence (e.g., in an in vitro 
transcription/translation system or in a host cell when the vector is introduced into the host 

15 cell). . ; : 

The term "regulatory sequence" is intended to includes promoters, enhancers and other 
expression control elements (e.g., polyadenylation signals). Such regulatory sequences are 
described, for example, in Goeddel, Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, Calif. (1990). Regulatory sequences include 
20 those that direct constitutive expression of a nucleotide sequence in many types of host cell and 
those that direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art that the 
design of the expression vector can depend on such factors as the choice of the host cell to be 
transformed, the level of expression of protein desired, etc. The expression vectors of the 
25 invention can be introduced into host cells to thereby produce proteins or peptides, including 
fusion proteins or peptides, encoded by nucleic acids as described herein (e.g. , AMFX proteins, 
mutant forms of AMFX proteins, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for expression of 
AMFX proteins in prokaryotic or eukaryotic cells. For example, AMFX proteins can be 
* : ' ® x P ressec * in bacterial cells such as Escherichia coli, insect cells (using baculovirus expression 

vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, 
Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, 
Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 
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Expression of proteins in prokaryotes is most often carried out in Escherichia coli with 
vectors containing constitutive or inducible promoters directing the expression of either fusion 
or non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded 
therein, usually to the amino terminus of the recombinant protein. Such fusion vectors 
5 typically, serve three purposes: (z) to increase expression of recombinant protein; (ii) to increase 
the solubility of the recombinant protein; and (m) to aid in the purification of the recombinant 
protein by acting as a ligand in affinity purification.,, Often* in fusion expression vectors, a 
proteolytic cleavage site is in^pducied £t ther junction of the fusion moiety and the recombinant 
protein to enable separation of therecombinant protein. frpm-thefusion moiety: subsequent to 
1 Of punficatipn of the fusion protein. Such enzymes* and th^ir cognate, recognition sequences,. 
include Factor X^tfhipinbiiva^ include 
pGEX (Fharjraci^Bio^^ $n4 Johiison^i 988. Gme 61: 31 -40), pMAL; (New 

England Biolabs, T Beverly , Mass,) and ■ pRJT5 (Phann^ci^, Pi^ca^way, N. J.) that fuse ; - . , 
glutathione S-transferase (GST), maltoseJE binding T protem, ox. protein A, respectively, to the 

15 toget raombinant prptein. i^-^v r:?r:-o..;: ::::-J - :»h?Kv.: ; i!:r;:v — -v- ; : : v:*-:^-.- 
Examples of suitable inducible non-fusion IL coli expression vectors include pTrc 
(Amrann et al, y (1 988) [ Gene 69:301^31 5) ai^d pEX 114 (SUidier et aL 9 Gene Expression . 
Technology; MErrao^ Press, San Diego, Calif. (1990) 

60-89).. ■ r:il . ::;,;:„;--- ... , ;: .... , \ .,. - 

20 One strategy to maximize recombinant protein ^ 

protein in a host b^teria.\vith an impair^, capacity to, proteolytically cleave, the recombinant 
protein. - See, e.g., Gottesm^ G^E Expression Technology: Methods in Etymology 
185, Academic Pt^jSm Another strategy is to alter the 

nucleic acid sequence of the nucleic acid to be inserted into an expression vector so that the 

25 individual cpdons for ea<?h amino add are- those preferentially utilized in K coli (see, e.g., 

Wada, : ef al., 1992. Nucl Acids Res. 20: 211-1-2118).- Such alteration of nucleic acid sequences 
of the invention can: be carried out by standard DNA synthesis techniques. 

In another embodiment, the AMFX expression vector is a yeast expression vector. 
Examples of vectors for expression in yoastSaccharomycescerivisae include pYepSecl 

30 (Baldari, et a/.^ 1987. EMBO J. 6: 229-234), pMFa (Kurjan and Herskowitz, 1982. Cell 30: 
933-943), pJRY88 (Schnlteera/., 1987. Gene 54: 113-123), pYES2 (Invitrogen Corporation, 
San Diego, Calif.), and picZ (InVitrogen Corp, San Diego, Calif.). 

Alternatively, AMFX can be expressed in insect cells using baculovirus expression 
vectors. Baqulovirus vectors available for expression of proteins in cultured insect cells (e.g., 
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SF9 cells) include the pAc series (Smith, et al 9 1983. Mol. Cell Biol. 3: 2156-2165) and the 
pVL series (Lucklow and Summers, 1989. Virology 170: 31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in mammalian 
cells using a mammalian expression vector. Examples of mammalian expression vectors 
5 include pCDM8 (Seed, 1987. Nature 329: 840) and pMT2PC (Kaufman, et aL, 1987. EMBO J. 
6: 187-195). When used in mammalian cells, the expression vector's control functions are 
often provided by viral regulatory elements. For example, commonly used promoters are 
derived from polyoma, adenovirus 2, cytomegalovirus, and simian virus 40. For other suitable 
expression systems for both prokaryotic and eukaryotic cells see, e.g., Chapters 16 and 17 of 

10 Sambrook, et ah, Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring 
Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. 

In another embodiment, the recombinant mammalian expression vector is capable of 
directing expression of the nucleic acid preferentially in a particular cell type (e.g., 
tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 

15 regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific 
promoters include the albumin promoter (liver-specific; Pinkert, et aL, 1987. Genes Dev. 1 : 
268-277), lymphoid-specific promoters (Calame and Eaton, 1988. Adv. Immunol 43: 235-275), 
in particular promoters of T cell receptors (Winoto and Baltimore, 1989. EMBO J. 8: 729-733) 
and immunoglobulins (Banerji, et al, 1983. Cell 33: 729-740; Queen and Baltimore, 1983. 

20 Cell 33: 741-748), neuron-specific promoters (e.g., the neurofilament promoter; Byrne and 
Ruddle^ 1989. Prpc. Natl Acad. Sci. USA 86: 5473-5477), pancreas-specific promoters 
(Edlund, et al. 9 1985. Science 230: 912-916), and mammary gland-specific promoters (e.g., 
milk whey promoter; U.S. Pat. No. 4,873,316 and European Application Publication No. 
264,166). Developmentally-regulated promoters are also encompassed, e.g., the murine hox 

25 promoters (Kessel and Gruss, 1990. Science 249: 374-379) and the a-fetoprotein promoter 
(Campes and Tilghman, 1989. Genes Dev. 3: 537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of the invention cloned into the expression vector in an antisense orientation. That is, 
the DNA molecule is operatively-linked to a regulatory sequence in a manner that allows for 

30 expression (by transcription of the DNA molecule) of an RNA molecule that is antisense to 
AMFX mRNA. Regulatory sequences operatively linked to a nucleic acid cloned in the 
antisense orientation can be chosen that direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen that direct constitutive, tissue specific or cell type specific expression 
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of antisense RNA. The antisense expression vector can be in the form of a recombinant 
plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the activity of which can be determined by the 
cell type into which the vector is introduced. For a discussion of the regulation of gene 
5 expression using antisense genes see, e.g. x , Weintraub, et aL, , "Antisense RNA^as a molecular 
tool for genetic analysis," Reviews-Trends in Genetics, Vol. lQ.^ 19S6. 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of theJnventiqn^ 
"recombm^t host ceU^ ^ 

10 not only tathe particular subject cell .but also to the progeny or v potential proge^pf such a 
cell. Because certain modifications may occur in 5uc^ee either- ; , 

mutation or environment^ infl^nce^ fecjj be ^id^T&^tprthe parent 

cell, but are still included wtlyn f th^ - RVi ^ ; : : -" \. 

A ; host cell can be any prokaiy olic or eul^^ AMEX protein can 

15 be expressed in; bacterid ^eUs sudtr ^s R cqli^ insect cells, yeast cm: mammalian cells (such as 
Chinese hamstoove^ cells)^ Other swtebte^^pdils are ^ known to those 

skilled inthe art r - , _ _ : - -y: : M:h^ rc^.r^r.^, ior c;u;a;K-, — rviv'^j. ■'«:«■ ^^ r r:-:.:-A,:.\ 
.yectcwr.DNA can be introduced into prokaryotic or eukary otic cells via conventional 
transformation or transfectipn tep^quesr* As used herein, tiie ^ terms "txansfoimation" and , 

20 "trarafection" are intended to refer to, a yarietYro£art-r^ . 
foreign nucleic acicL(e.g: , DNA) into, a ho?t cell* including calcium phosphate or calcium 
cMoride co-precipitatioiv„^ 

Suitable methods ^ fop tr^fornung or transfecting host cells xan.be; found in Sambrppk, etal. 
(Molecular Cloning; A . La^o^tqry Manual. 2nded., Cold Spring^Harbpr Laboratory, 

25 Cold Spring Harbor Laboratory Press, Cod&Spring Harbor, N. Y., 1 989), and other laboratory 
manuals.;-. • - * .. ? . is ._.--. - ._- - i ■ 

For stable transfection of mammalian cells, it is known that^ depending upon the 
expression vector and transfection technique used, only a small fraction of cells may integrate 
the foreignDNA inta their genome. In order to identify and select these integrants, a gene that 

30 encodes a selectable marker {e.g. 9S resistance to antibiotics) is generally introduced into the host 
cells along with the gene of interest Various selectable markers include those that confer 
resistance to drugs, such as G418, hygromycin and methotrexate. Nucleic acid encoding a 
selectable marker can be introduced into a host cell on the same vector as that encoding AMFX 
or can be introduced on a separate vector. Cells stably transfected with the introduced nucleic 
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acid can be identified by drug selection {e.g., cells that have incorporated the selectable marker 
gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can 
be used to produce (i.e., express) AMFX protein. Accordingly, the invention further provides 
5 methods for producing AMFX protein using the host cells of the invention. In one 
embodiment, the method comprises culturing the host cell of invention (into which a 
recombinant expression vector encoding AMFX protein has been introduced) in a suitable 
medium such that AMFX protein is produced. In another embodiment, the method further 
comprises isolating AMFX protein from the medium or die host cell. 

1 0 Transgenic AMFX Animals 

The host cells of the invention can also be used to produce non-human transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized oocyte or 
an embryonic stem cell into which AMFX protein-coding sequences have been introduced. 
Such host cells can then be used to create non-human transgenic animals in which exogenous 

1 5 AMFX sequences have been introduced into their genome or homologous recombinant animals 
in which endogenous AMFX sequences have been altered. Such animals are useful for 
studying the function and/or activity of AMFX protein and for identifying and/or evaluating 
modulators of AMFX protein activity. As used herein, a "transgenic animal" is a non-human 
animal, preferably a mammal, more preferably a rodent such as a rat or mouse, in which one or .■ 

20 more of the cells of the animal includes a transgene. Other examples of transgenic animals 

include non-human primates, sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene 
is exogenous DNA that is integrated into the genome of a cell from which a transgenic animal 
develops and that remains in the genome of the mature animal, thereby directing the expression 
of an encoded gene product in one or more cell types or tissues of the transgenic animal. As 

25 used herein, a "homologous recombinant animal" is a non-human animal, preferably a 

mammal, more preferably a mouse, in which an endogenous AMFX gene has been altered by 
homologous recombination between the endogenous gene and an exogenous DNA molecule 
introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior to development 
of the animal. 

30 A transgenic animal of the invention can be created by introducing AMFX-encoding 

nucleic acid into the male pronuclei of a fertilized oocyte (e.g., by microinjection, retroviral 
infection) and allowing the oocyte to develop in a pseudopregnant female foster animal. The 
human AMFX cDNA sequences of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17 and 19, can be 
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introduced as a transgene into the genome of a non-human animal. Alternatively, a non-human 
homologue of the human AMFX gene, such as a mouse AMFX gene, can be isolated based on 
hybridization to the human AMFX cDNA (described further supra) and used as a transgene. 
Intronic sequences and pplyadenylation signals can also be included in the transgene to 
5 increase, the efficiency. of.expre^ion.of the transgsne.,.A tissuerspecific regulatory sequence(s) * 
can be operably-linked to the ; AMFX transgene to direct expression of AMFX protein to 
particular cells. : Methods for g^^ animals via embryo manipulation and 

microinjection^ conventional in the art and are 

described, forexample, m U^ A patent Kos, .4^73 6>8 : 66; 4,870,009; . ^ : 4,873,t91; and Hogan, 

10. 1 986, Im l^MPULAT^^ LabOTatory.PresSi;Cold 
Spring Harbor,,NrY. ; Similar method^ eure^used .fofprodu^ animals.- A 

transgenic foinxdeac, anim^ be identified ba^ upp^ ; ^e presepse^ of ,^e AMEX transgene in 
its genomaan$l£^ of AM^ geUs ,o£ the; animal?. u A transgenic ^ 

foi^der £ttim additional animals ,9arrymg the, transgene. -Moreover, 

1 5 transgenic anin^lscafr^ AMFX prpteip caafurther be bred to other 

traqggemq animals p^rryijog Qtto,tr^gene&~ v , :: , . ; , c ,, : T _ 

T To create a homologpu^ recombinant animal, a r yectoi is prepared which contains at , 
least ^portiqivof ^ t ^MF^ 
introduced to thereby; altCT^e.^^ 

20 b$ ahuman gwe,^,the ^MA-^EQ ; m NOS: U3,A'7, 9,. 11,. 13,. 1 5» : 1 7 : an4 1 9), but 
more preferably^ is a non-human hompl^^ of ahtunan ^ . For example, a mouse 

homologue jpfbuiw 13, 15, 17 and 19, can be 

used to constmc* a^hoi^ologousa^ suitable for altering an endogenous 

AMFX . gene r ii*.the c mouse genome*, to.one.embqdim^^ ^ vector is designed such that, upon 

25 homologous recombination, tbe endogenous AMFX gene is functionally disrupted .(/.«., no 
longersncodes. a functional protein; also referred to as a "knock out" vector). 

Alternatively, the vector can be designed such that^ upon homologous recombination, 

the endogenous. AMFX gene is mutated or otherwise altered but still encodes functional 
protein (e.g x , the upstream regulatory region.can be altered to thereby alter the expression of 

30 the endogenous AMFX.protein). In the homologous recombination vector, the altered portion 
of the AMFX gene is flanked at its 5'- and 3 Vtermini by additional nucleic acid of the AMFX 
gene, to allow for homologous recombination, to occur between the exogenous AMFX gene 
carried by the vector and an endogenous AMFX gene in an embryonic stem cell. The 
additional flanking AMFX nucleic acid is of sufficient length for successful homologous 
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recombination with the endogenous gene. Typically, several kilobases of flanking DNA (both 
at the 5- and S'-termini) are included in the vector. See, e.g., Thomas, et al. 9 1987. Cell 51 : 
503 for a description of homologous recombination vectors. The vector is ten introduced into 
an embryonic stem cell line (e.g., by electroporation) and cells in which the introduced AMFX 
gene has homologously-recombined with the endogenous AMFX gene are selected. See, e.g., 
U,etaL, 1992. Ce//69:915. 

The selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to 
form aggregation chimeras. See, e.g. , Bradley, 1987. In: Teratocarcinomas and 
Embryonic Stem Cells: A Practical Approach, Robertson, ed. IRJL, Oxford, pp. 1 1 3 - 1 52. 
A chimeric embryo can then be implanted into a suitable pseudopregnant female foster animal 
and the embryo brought to term. Progeny harboring the homologously-recombined DNA in 
their germ cells can be used to breed animals in which all cells of the animal contain the 
homologouslyrrecombined DNA by germline transmission of the transgene. Methods for 
constructing homologous recombination vectors and homologous recombinant animals are 
described further, in Bradley, 1 991 . Curr. Opvz Biotechnol 2: 823-829; PCT International 
Publication Nos;: WO 90/11354; WO 91/01140; WO 92/0968; and WO 93/04169. 

In another embodiment, transgenic non-humans animals can be produced that contain 
selected systems that allow for regulated expression of the transgene. One example of such a 
system is the cre/loxP recombinase system of bacteriophage P 1 . For a description of the 
cre/loxP recombinase system, See f e.g., Lakso, et a!. 9 1992. Proc. Natl Acad Sci. USA 89: 
6232-6236. Another example of a recombinase system is the FLP recombinase system of 
Saccharomyces cerevisiae. See, O'Gorman, et al 9 1991. Science 251:1351-1355. Ifacre/loxP 
recombinase system is used to regulate expression of the transgene, animals containing 
transgenes encoding both the Cre recombinase and a selected protein are required. Such 
animals can be provided through the construction of "double" transgenic animals, e.g.* by 
mating two transgenic animals, one containing a transgene encoding a selected protein and the 
other containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut, et al. y 1997. Nature 385: 810-813. In brief, a 
cell (e.g., a somatic cell) from the transgenic animal can be isolated and induced to exit the 
growth cycle and enter Go phase. The quiescent cell can then be fused, e.g., through the use of 
electrical pulses, to an enucleated oocyte from an animal of the same species from which the 
quiescent cell is isolated. The reconstructed oocyte is then cultured such that it develops to 
morula or blastocyte and then transferred to pseudopregnant female foster animal. The 
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offspring borne of this female foster animal will be a clone of the animal from which the cell 
(e.g., the somatic cell) is isolated. 

Pharmaceutical Compositions 

■ , . The AMFX nucleio acid molecules, AMFX proteins* and anti-AMFX antibodies (also 
5 referred to herein as rr actiye compounds") of the invention, and derivatives, fragments, analogs 
and hpmolpgs thereof, can be j^corpprated ; into phamaTOutical compositions suitable for 
administratioa -.. Such compositions typically comprise the nucleic acid molecule, protein, or 
antibody and a pharmaceuticallyv acceptable carrie*. As ^$e$l Jiereinc "pharmaceutical^ 
0 a^p^le.cam^ r ' : is:m^^ded to include any^and an.^vsnt5,,d^ < 
10 antibacterial and antifimgal agents i; isotom^^ tikG'*<. 
compatible with pfcacra^ewtie^ac^^ Suitable c^^^& escribed in the-most ? 

recent edition of Remi^ 

which is inwiporate^^ of sucl^carriers^pF diluents, 

include, but r ; are ^ ^.limited to, water, saline, finger's solutions, dextrose soluti^ 

1 5 human semm^buxp^ ^dnonraquepus ,yehi<f fes, such as .fixed oils may also be 

used ; The use of such media and agents for phariDa^utically lactiye substances is well known 
in the art. Except insofar as : any : conventional media or agent is incompatible ^ with the active 
compound,, use the^ep£m Supplementary active compounds 

canalso bejncpiporated : ^ : n _-.r:u "-j: . - 

20 - A pharmaceutical^ with its 

intended route of administration. Examples; of routes of admimstration include parenteral, e.g. , 
intravenous, intradermal, su^ 

transmucosal, and rectaLadministration. Solutions or suspensions used for parenteral, 
intradermal, or subcu^^pm appUcation can include &e fpUowing cpmppneiits: a sterile 

25 diluent such as water for injection* saline solution, fixed oils* polyethylene glycols, glycerine, 
propylene glycol or other synthetic solvents; antibacterial agents such as benzyl alcohol or 
methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such 
as ethylenediaminetetraacetic acid (EDTA); : buffers such as : acetates, citrates or phosphates, 
and agents for the adjustment of tonicity such as sodium chloride or dextrose. The pH can be 

30 adjusted with acids or bases, such as hydrochloric acid or sodixim hydrpjdde. The parenteral 
preparation can be enclosed in ampoules, disposable syringes or multiple dose vials made of 
glass or plastic. . ■ 
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Phaimaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersion. For intravenous administration, 
suitable carriers include physiological saline, bacteriostatic water, Cremophor EL™ (BASF, 
5 Parsippany, N.J.) or phosphate buffered saline (PBS). In all cases, the composition must be 
sterile and should be fluid to the extent that easy syringeability exists. It must be stable under 
the conditions of manufacture and storage and must be preserved against the contaminating 
. action of microorganisms such as bacteria and fungi. '-The carrier can be a solvent or dispersion 
medium containing, for example, water, ethanol, polyol (for example, glycerol, propylene 

1 0 glycol, and liquid polyethylene glycol, and the like), and suitable mixtures thereof. The proper 
fluidity can be maintained, for example, by the use-of a coating such as lecithin, by the 
maintenance of the required particle size in the -caSSbf dispersion arid by the use of surfactants. 
Prevention of the action of microorganisms can be achieved by various antibacterial and 
antifungal agents, for example, parabens, chlorobiitanol, phenol, ascorbic acid, thimerosal, and 

15 the like. In many cases, it will be preferable to include isotonic agents, for example, sugars, 
polyalcohols such as manitol, sorbitol, sodium chloride in the composition. Prolonged 
absorption of the injectable compositions can be brought about by including in the composition 
an agent which delays absorption; for example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound (e.g., 

20 an AMFX protein or anti-AMFX antibody) in the required amount in an appropriate solvent 
with one or a combination of ingredients enumerated above, as required, followed by filtered 
sterilization. Generally, dispersions are prepared by incorporating the active compound into a 
sterile vehicle that contains a basic dispersion medium and the required other ingredients from 
those enumerated above. In the case of sterile powders for the preparation of sterile injectable 

25 solutions, methods of preparation are vacuum drying and freeze-drying that yields a powder of 
the active ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They can be 
enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic 
30 administration, the active compound can be incorporated with excipients and used in the form 
of tablets, troches, or capsules. Oral compositions can also be prepared using a fluid carrier for 
use as a mouthwash, wherein the compound in the fluid carrier is applied orally and swished 
and expectorated or swallowed. Pharmaceutically compatible binding agents, and/or adjuvant 
materials can be included as part of the composition. The tablets, pills, capsules, troches and 
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the like can contain any of the following ingredients, or compounds of a similar nature: a 
binder such as microciystalline cellulose, gum tragacanth or gelatin; an excipient such as starch 
or lactose, a disintegrating agent such as alginic acid, PrimogeCor corn starch; a lubricant siich 
as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening 
5 agent such as sucrose of saccharin- or a flavoring aigent such a$ peppermint, methyl salicylate, 
or orange flavoring, - ! -~ ""- ^-^k^ r^i„,,\ 

For administration by inhalation, the : compounds are delivered in the form of an aerosol 
spray from pressured container or dispenser * which* contains a suitable propellant, e.g. , a gas 
such as carb<bn- dioxid^^^ ii™* iiuhvid uair.. ^pa.cci : . :. 1 

10 y - : - 

traiismucosarf-ar £rafl^*»iatadm^ 

pbimeated^l^-u^-iBtte f6Hti&a&&i£NSu^^ : * 

include, for example, for trammucosat administratio»i detergent^ brie salfe, and ! fusidteacid o 
derivatives^ Traiismuco^&teu^ ofnasal spiay s 

1 5 or suppositories; Fog traiigdeimat adm M^atkin ^ the active kxtfnpctmds are formulated into 
ointments, salves, gels* orcreams ^^d£grtily'Jbid^wiiy-&0 -iad*tP - .. . - 

* : Theeompoimdseaa'^ ; 
conventional suppository bases such as cocoa butter and other glycerides) or retention enemas 
for rectal delivery ^qii£r±v?5 ^^nb-d '"-q;ti cert. ^ornc c-^roe. be- uzic:- - = i .-v ^ 

20 - In one eknbbdiment/the i^Ve ^ protect" 

the compomid agaii«;iap^^ 

formulation; including impltttSii^ systems; Biodegradable, 

biocompatible jk)lyi^^«^ber us^ suctf as^ ethylene ivinyl acetate^ pdlyanhydrides : 
polyglya)Uc^acad,^coll^OT, T polyor^de"SFt^, and polylactic^cick Methods for preparation of 
25 such formutetibnS:will be apparent to those skilled in the art "The materials can alsa be 

obtained cx)mm<erc^ Liposomal 
suspensions (including liposomes targeted to infected cells with monoclonal antibodies to viral 
antigens) can alsa be used as pharmaceutically acceptable carriers^ These can be prepared 
according to methods known to those skilled in the art, for example, as described in U.S. Patent 

^- Itis especially advantageous to formulate^oral or parenteral compositions in dosage unit 
form for ease of administration and uniformity of dosage. Dosage unit form as used herein 
refers to physically discrete- units suited as unitary dosages for the subject to be treated; each 
unit containing a predetermined quantity of active compound calculated to produce the desired 
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therapeutic effect in association with the required pharmaceutical carrier. The specification for 
the dosage unit forms of the invention are dictated by and directly dependent on the unique 
characteristics of the active compound and the particular therapeutic effect to be achieved, and 
the limitations inherent in the art of compounding such an active compound for the treatment 
of individuals. 

The nucleic acid molecules of the invention can be inserted into vectors and used as 
gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for example, 
intravenous injection, local administration (see, e.g., U.S. Patent No. 5,328,470) or by 
stereotactic injection (^ee/e.# , Chen, et ah, 1994. Proc. Natl Acad: ScL USA 91: 3054-3057). 
The pharmaceutical, preparation of the gene therapy vector can include the gene therapy vector 
in an acceptable diluent, or can comprise a slow release matrix in which the gene delivery 
vehicle is imbedded. Alternatively, where the complete gene delivery vector can be produced 
intact from recombinant cells, e.g. , retroviral vectors, the pharmaceutical preparation can 
include one or more cells that produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or dispenser 
together with instructions for administration. 

Screening and Detection Methods 

The isolated nucleic acid molecules of the invention can be used to express AMFX 
protein (e.g., via a recombinant expression vector in a host cell in gene therapy applications), 
to detect AMFX mRNA (e.g. , in a biological sample) or a genetic lesion in an AMFX gene, 
and to modulate AMFX activity, as described further, below. In addition, the AMFX proteins 
can be used to screen drugs or compounds that modulate the AMFX protein activity or 
expression as well as to treat disorders characterized by insufficient or excessive production of 
AMFX protein or production of AMFX protein forms that have decreased or aberrant activity 
compared to AMFX wild-type protein (e.g; diabetes (regulates insulin release); obesity (binds 
and transport lipids); metabolic disturbances associated with obesity, the metabolic syndrome 
X as well as anorexia and wasting disorders associated with chronic diseases and various 
cancers, and infectious disease(possesses anti-microbial activity) and the various 
dyslipidemias. In addition, the anti-AMFX antibodies of the invention can be used to detect 
and isolate AMFX proteins and modulate AMFX activity. In yet a further aspect, the invention 
can be used in methods to influence appetite, absorption of nutrients and the disposition of 
metabolic substrates in both a positive and negative fashion. 
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The invention further pertains to novel agents identified by the screening assays 
described herein and uses thereof for treatments as described, supra. 
Screening Assays 

The invention provides a method (also referred to herein as a "screening assay' 1 ) for 
5 identifying modulators^ i.e.', candidate or test compounds or agents {e.g. , peptides, 

peptidomimStics, small molecules or other drugs) that bind to AMFX proteins or have a 
stimulatory or inhibitory effect on, e. g:, AMFX protein egression or AMFX protein activity. 
ITieinveiitiohalsd includes conip^ herein. 

In one embbdiiiient, the invention prbvides assays for screening' candidate or test 
10 compounds which bind to v 6r mbdMate tte 

protem or polypeptide br bibk>gically-active pbrtibi* tKerfcbf. 1 Tffie test compounds of the - 
invention: can bb obtained iiisiirg'^ 
itieffiods kri6Wh : in the ^ mcludmgrbm^ 

ph^e or solution ph^^ the - 1 

15 "one-bead bne-cbmpourid ,, libraiyTriethod; md synthetSc libtantf methods using affinity r: - 

chromatography selection: 17 The biblbgicaBibraiy approach is limited to peptide libraries, while 
the other foiir approaches are applicable to peptide^ nbn^pepti&e oligomer or small molecule 
libraries of compounds; See, e.£.y^ 145. 1 
A" "smaU mblbcule" as 1 

20 molecular weight*)? less than abdiit 5 kl> and most preferably less than about 4 kD. Small 

i&blecules can b& 5 ;e.g; 9 nucleic acids, peptides, polypeptides, p^fidomiihetics, carbohydrates, 
lipids orothdr org^nic r br inorganic mbleciilesr.^ Libraries of chemical and/or biological 
mixtures, such as fungal, bacterial; or algal extracts, 1 are known in the art andean be screened 
with my of the assays of the inventions : r 1 

25 Examples of Methods for the synthesis of molecular libraries can be found in the art, for 

example in: DeWitt, et al:, 1 993. Proc. Natl Acad. Set. U.S.A. 90: 6909; Erb, et dl, 1 994. 
Proc Natl. Acad Sci. U.S.A. 91r 1 1422; Zuckerinanii, et al., 1994. J. Med: Chem. 37: 2678; 
Cho, etal., 1993. Science 261: 1303; Carrell, etal, 1994. Angew: Chem. Int. Ed Engl. 33: 
2059; Carell, et a!:, 19941 Angew. Chem. Int: Ed Engl. 33: 2061; and Gallop, etal., 1994. J. 

30 Med. Chem. 37: 1233/ - - r 

Libraries of compounds may be presented in solution {e.g. , Houghten, 1992. 
Biotechniques 13: 412-421), or on beads (Lain, 1991. Native 354: 82-84), on chips (Fodor, 
1993. Nature 364: 555-556), bacteria (Ladner, U.S. Patent Nb; 5,223,409), spores (Ladner, 
U.S. Patent 5,233,409), plasmids (Cull, etal., 1992. Proc. Natl. Acad. Sci. USA 89: 1865-1869) 
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or on phage (Scott and Smith, 1 990. Science 249: 386-390; Devlin, 1990. Science 249: 
404-406; Cwirla, et ah, 1990. Proc. Natl. Acad. Set U.S.A. 87: 6378-6382; Felici, 1991. J. 
Mol. Biol. 222: 301-310; Ladner, U.S. Patent No. 5,233,409.). 

In one embodiment, an assay is a cell-based assay in which a cell which expresses a 
5 membrane-bound form of AMFX protein, or a biologically-active portion thereof, on the cell 
surface is contacted with a test compound and the ability of the test compound to bind to an 
AMFX protein determined. The cell, for example, can of mammalian origin or a yeast cell. 
Determining the ability of the test compound to bind to the- AMFX protein can be 
accomplished, for example, by coupling the test compound" with a radioisotope or enzymatic 

1 0 label such that binding of the test compound to the AMFX protein or biologically-active 
portion thereof can be determined by detecting the labeled compound in a complex. For 
example, test compounds can be labeled with 125 I, 35 S, 14 C, or 3 H, either directly or indirectly, 
and the radioisotope detected by direct counting of radioemission or by scintillation counting. 
Alternatively, test compounds can be enzymatically-labeled with, for example, horseradish 

1 5 peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label detected by 

determination of conversion of an appropriate substrate to product. In one embodiment, the 
assay comprises contacting a cell which expresses a membrane-bound form of AMFX protein, 
or a biologically-active portion thereof, on the cell surface with a known compound which 
binds AMFX to form an assay mixture, contacting the assay mixture with a test compound, and 

20 determining the ability of the test compound to interact with an AMFX protein, wherein 
determining the ability of the test compound to interact with an AMFX protein comprises 
determining the ability of the test compound to preferentially bind to AMFX protein or a 
biologically-active portion thereof as compared to the known compound. 

In another embodiment, an assay is a cell-based assay comprising contacting a cell 

25 expressing a membrane-bound form of AMFX protein, or a biologically-active portion thereof, 
on the cell surface with a test compound and determining the ability of the test compound to 
modulate (e.g. , stimulate or inhibit) the activity of the AMFX protein or biologically-active 
portion thereof. Determining the ability of the test compound to modulate the activity of 
AMFX or a biologically-active portion thereof can be accomplished, for example, by 

30 detennining the ability of the AMFX protein to bind to or interact with an AMFX target 
molecule. As used herein, a "target molecule" is a molecule with which an AMFX protein 
binds or interacts in nature, for example, a molecule on the surface of a cell which expresses an 
AMFX interacting protein, a molecule on the surface of a second cell, a molecule in the 
extracellular milieu,: a. molecule associated with the internal surface of a cell membrane or a 
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cytoplasmic molecule. An AMFX target molecule can be a non-AMFX molecule or an AMFX ^ 
protein or polypeptide of the invention. In one embodiment, an AMFX target molecule is a 
component of a signal transduction pathway that facilitates transduction of an extracellular <j 
signal (e.g. a signal generated by binding of a compound to a membrane-bound AMFX 
5 molecule) through the cell membrane and intathe cell. T&e target^ for example, can be a "N 
second intercellular protein that has catalytic activity or a prptein-that facilitates the association 
ofdownstream signaling molecules with AMFXv ^ . ^ r . 

Determining the ability of the; AMFX protein. tQifejjjuJ; to ■ oe infract with an AMFX- 
. target molecule can be accomplished by one.oftheime^ods described above for determining 

10 direct binding* Jn:one ^mbodime^t;:deteniii^g^ t^ ability of ihe^AMEX protein to bind to or 
interaet;with an AMFX target molecule- can be_»rompUs^^ the activity of the 

target molecule. For example j.the activity^ the tog^T^leci^cantbe; deteamiiied by v . r : V ? 
detectinfrinduetioikof ace^ > 
diacylglycerol, IP 3 , etc;) ^ detecting: eat^j^c&s^matic* activity-jaf the t^getan appropriate 

15 substrate,, detecting, flie indjuotipn of a r<sp9i1«E;gene;(pompiising an A^^?X-Ile$ponsive: ,;o : 
reguliatoryelementop e.g., 
luciferase^pr defespting a cellular response, for example; cell survival, cellular differentiation, 

Orcell prplifteadonv ^ -;r; ..k-reCH^; *r lesion iiw^vr:: l;v:; i;„;c; -f- 

. In yet another, embodiment, an assay of the invention is a xelMree; assay comprising 
20 contacting an AMFX proteirtor biologically-active portion thereof with a test compound and 

determining theabiUty ofthe.te^compoim^ ~. 
active portion thereof, Binding of the test compounctto the ^ Al^X.protein.can^be d$tennined * 
either directly or indirectly, as described above; , In one such embodiment, the assay comprises 
contacting the, AMFX pjptein : qr : hdologic^ly-aetiye portion thereof , with a known compound 
25 which binds AMFX: tQ form an a^y mixture* contacting tbe assay mixture with a test 

. compound, and determining the ability of the test, compound tpinteract with an AMFX protein, 
wherein determining the ability; of the te^ compound to inter^t >\dth ^ AMFX protein fc 
comprises determining the ability of the test compound to preferentially bind to AMFX or 
biologically-active portion thereof as compared to the known compound. 
30 In still another embodiment, an assay is a cell-free assay comprising contacting AMFX 

protein or biologically-active portion thereof with a test compound and determining the ability 
of the test compound to modulate (e.g. stimulate or inhibit) the activity of the AMFX protein or 
biologically-active portion thereof. Determining the ability; of the test compound to modulate 
the activity of AMFX can be accomplished,, for example, by determining the ability of the 
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AMFX protein to bind to an AMFX target molecule by one of the methods described above for 
determining direct binding. In an alternative embodiment, determining the ability of the test 
compound to modulate the activity of AMFX protein can be accomplished by determining the 
ability of the AMFX protein further modulate an AMFX target molecule. For example, the 
5 catalyticv'enzymatic activity of the target molecule on an appropriate substrate can be 
determined as described, supra. - 

In yet another embodiment, the cell-free assay comprises contacting the AMFX protein 
or biologicallyractive portion thereof with a known compound which binds AMFX protein to 
form an assay mixture, contacting the assay mixture with a test compound, and determining the 

10 ability of the test compound to interact with an AMFX protein, wherein determining the ability 
of the test compound to interact with an AMFX protein comprises determining the ability of 
the AMFX protein to preferentially bind to or modulate the activity of an AMFX target 
molecule. , ^ ^ : . , ... ... - „- . • 

The cell-free assays of the invention are amenable to use of both the soluble form or the 

1 5 membrane-bound form of AMFX protein-. In the case of cell-free assays comprising the 

membrane-bound form of AMFX protein, it may be desirable to utilize a solubilizing agent 
such that the membrane-bound form of AMFX protein is maintained in solution. Examples of 
such solubilizing agents include non-ionic detergents such as n-octylglucoside, 
n-dodecylglucoside, n-dodecylmaltoside, octarioyl-N-methylglucamide, 

20 decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, Thesit®, Isotridecypoly(ethylene , 
glycol ether) n , N-dodecyl— N,N-dimethy 1-3-ammonio- 1 -propane sulfonate, 
3-(3-cholamidopropyl) dimethylamminiol-1 -propane sulfonate (CHAPS), or 
3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy-l -propane sulfonate (CHAPSO). 

In more than one embodiment of the above assay methods of the invention, it may be 

25 desirable to immobilize either AMFX protein or its target molecule to facilitate separation of 
complexed from uncomplexed forms of one or both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to AMFX protein, or interaction of 
AMFX protein with a target molecule in the presence and absence of a candidate compound, 
can be accomplished in any vessel suitable for containing the reactants. Examples of such 

30 vessels include microtiter plates, test tubes, and micro-centrifuge tubes. In one embodiment, a 
fusion protein can be provided that adds a domain that allows one or both of the proteins to be 
bound to a matrix. For example, GST-AMFX fusion proteins or GST-target fusion proteins 
can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtiter plates, that are then combined with the test compound or the 
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test compound and either the non-adsorbed target protein or AMFX protein, and the mixture is 
incubated under conditions conducive to complex formation (e.g., at physiological conditions 
for salt and pH). Following incubation, the beads or microtiter plate wells are washed to 
remove any unbound components, the matrix immobilized in the case of beads, complex 
5 determined either directly or indirectly, for example, as described^ szipra. Alternatively, the 
complexes can be dissociated from the matrix, and the level of AMFX proteurbinding or 
activity determined using standard techniques.: ..... 

r * Other techniques for immobilizing proteins o^ 
screeningiassays of the invention:- For example;: either thei AMFX protein or its target molecule 
10 can be immobilizeiutilizing conjugation of biothxand streptavidirt Biotinylated AMFX 
protein, or target motecules:caii % be;prepared>fr^ 

techniques weH+knswir,vtit^ Rockfbitf, 111), 

and immobilized in the: wells off streptavidin-coated 96, well piates^Pierce Chemical)^ 
Alternatively^ antibodies resactive^wrflt AMFX;protem*>r target molecules, but which da not 

1 5 interfere with binding of the AMFX protein to its target moleculei can! be deriyatized to the 
wells of the plate, and unbound target or AMFX protem trapped iirthe wells byantibody 
conjugation; ' Methods for detecting such complexes^ in addition to tbose'-described above for 
the GST-immobilized complexes, include immunodetection of complexes using antibodies 
reactive .with the AMFX protein^or target moleculei: as well as enzyme^linked assays that rely 

20 on detecting an enzymatic, activity: associated ^with-the AMFX protein or target molecule. 

In another embodiment, modulators of AMFX protein; expression, are identified in a 
method wherein-aceltis contacted with a candidate compound and th& expression of AMFX 
mRNA 6r protein hi 1 the cell is determined:; The level of expression of AMFX mRNA or 
protein; in the presence of the candidate compoundis compared to the level of expression of 

25 AMFX mRNA or protein inthe absence of the candidate compound. The candidate compound 
can then: be identified as a modulator of AMFX mRNA or protein expression based upon this 
comparison. For example; when expression of AMFX mRNA or protein is greater (i.e. t 
statistically significantly greater) in the presence of the candidate compound than in its 
absence, thexandidate compound is identified as a stimulator of AMFX mRNA or protein 

30 expressioru Alternatively, when expression of AMFX mRNA or protein is less (statistically 
significantly less) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an inhibitor of AMFX mRNA or protein expression. The level of 
AMFX mRNA of protein expression in the cells can be determined by methods described 
herein for detecting AMFX mRNA or protein. 
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In yet another aspect of the invention, the AMFX proteins can be used as "bait 
proteins" in a two-hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,3 17; 
Zervos, et aL, 1993. Cell 72: 223-232; Madura, et aL, 1993. J. Biol. Chem. 268: 12046-12054; 
Bartel, etal. 9 1993. Biotechniques 14: 920-924; Iwabuchi, et aL, 1993. Oncogene 8: 
5 1 693-1 696; and Brent WO 94/1 0300), to identify other proteins that bind to or interact with 
AMFX ("AMFX-binding proteins" or "AMFX-bp") and modulate AMFX activity. . Such 
AMFX-binding proteins are also likely to be involved in the propagation of signals by the 
AMFX proteins as, for example, upstream or downstream elements of the AMFX pathway. 
The two-hybrid system is based on. the modular nature/of most transcription factors, 

1 0 which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes 
two different DNA constructs. In one construct, the gene that codes for AMFX is fused to a 
gene encoding the DNA binding domain of a known transcription factor (e.g., GAL-4)! In the 
other construct, a DNA sequence, from a library of DNA sequences, that encodes an 
unidentified-protein ("prey" or "sample") is fused to a gene that codes for the activation domain 

15 of the known transcription factor. If the "bait" and the "prey" proteins are able to interact, in 
vivo, forming an AMFX-dependent complex, the DNA-binding and activation domains of the 
transcription factor are brought into close proximity. This proximity allows transcription of a 
reporter gene (e.g. , LacZ): that is operably linked to a transcriptional regulatory site responsive : 
to the transcription factor. Expression of the reporter gene can be detected and cell colonies 

20 containing the functional transcription factor can be isolated and used to obtain the cloned gene 
that encodes the protein which interacts with AMFX. 

The invention further pertains to novel agents identified by the aforementioned 
screening assays and uses thereof for treatments as described herein: 

Detection Assays 

25 Portions or fragments of the cDNA sequences identified herein (and the corresponding 

complete gene sequences) can be used in numerous ways as polynucleotide reagents. By way 
of example, and not of limitation, these sequences can be used to: (/) map their respective 
genes on a chromosome; and, thus, locate gene regions associated with genetic disease; (//) 
identify an individual from a minute biological sample (tissue typing); and (Hi) aid in forensic 

30 identification of a biological sample. Some of these applications are described in the 
subsections, below. 
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Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this * * 
sequence can be used to map the location of the gene on a chromosome. This process is called 
chromosome mapping. Accordingly, portions or fragments of the AMFX sequences, SEQ ID v 
5 NOS:l, 3 5 5, 7, 9 5 11,:13;:15, 17 and: 19, or fragments or derivatives thereof, can be used to 
map the location-of the AMFX genes^ respectively, on a chromosome. The mapping of the 
AMFX sequences to chromosomes is ah important first step in correlating these sequences 

with genes associated with- disease^?:- riT. or , :< : 

Briefly, AMFX genes can be mapped to chrompsomesiby preparing PGR primers 
10 (preferably 15^25^bpjinlebag^): from the^AMFX sequenees.s Gomputer analysis- of tiie-AMFX, 
sequences can be used to rapidly: select primes that do:not spartmore than one exoa inthe 
genomic DMA, thus complicating the amplification process. These primers can then be used 
for PCR; screening: of somatic cell hybrids containing individual human chromosomes. jOnly . s 

those hybrids containing the human gene corresponding tb the AMFX sequence will yield an 
15 amplifi^*flbagm«Qtvp:-c::.^:v. ^ :. viiiuihcd by z :.c—znm~ ;• '■■ -s 

■•\ : . Somatic cell hybrids are prepared by fusing somatic cells from different mammals {e.g. , 
human and* moiise;cells)is As hybrids^of human andmouse cells grow and divide, they . , 
gradually lose human chromosomes h^ 

using media in which mouse: cells: cannot grow; because they lack a particular enzyme, but in 
20 which human cells can, theione hunian chromc^ome that contains the gene encoding the 

needed enzyme will be retained. By using various media; panels of hybrid cell lines can be 
established. Each cell line in a panel contains either a single human chromosome or a small 
number of human chromosomes; and a full set of mouse* chromosomes; allowing easy mapping 
of individual genes to specific human chromosomes; :See\ e.g;, ^D'Eustachio, et aL, 1983; 
25 Science 220: 9 1 9-924;; iSomatic- cell hybrids containing only fragments of human : 

chromosomes can also be produced by using human chromosomes with translocations and 

deletions.; \-> . .:: =....:. v-.-iai :~ - _ - 

PCR mappings of somatic cell hybrids; is a rapid procedure for assigning a particular 
sequence to a particular chromosome, Three-or more , sequences can be assigned per day using 
30 a single thermal cycler;. Using the AMFX sequences to design oligonucleotide primers, sub- 
localization can be achieved with panels of fragments from specific chromosomes. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in one 
step. Chromosome spreads can be made using cells whose division has been blocked in 
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metaphase by a chemical like colcemid that disrupts the mitotic spindle. The chromosomes 
can be treated briefly with trypsin, and then stained with Giemsa. A pattern of light and dark 
bands develops on each chromosome, so that the chromosomes can be identified individually. 
The FISH technique can be used with a DNA sequence as short as 500 or 600 bases. However, 
5 clones larger than 1 ,000 bases have a higher likelihood of binding to a unique chromosomal 
location with sufficient signal intensity for simple detection. Preferably 1,000 bases, and more 
preferably 2,000 bases, will suffice to get good results at a reasonable amount of time. For a 
review of this technique, see, Verma, et al 9 HumaN;£hRomosomes: A Manual of Basic 
Techniques (Pergamon Press^ New York 1988). / 

10 Reagents for chromosome mapping can be used individually to mark a single 

chromosome or a single site on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. Reagents corresponding to noncoding 
regions of the genes actually are preferred for mapping --piuppses-...; Coding sequences are more 
likely to be conserved within gene families, thus increasing the chance of cross hybridizations 

15 during chromosomal mapping. \ 

Once a sequence has been mapped to a precise chromosomal location, the physical 
position of the sequence on the chromosome can be correlated with genetic map data. Such 
data are found, e.g., in McKusick, Mendelian Inheritance in Man* available on-line through » 
Johns Hopkins University Welch Medical Library). The relationship between genes and 

20 disease, mapped to the same chromosomal region, can then be identified through linkage 

analysis (co-inheritance of physically adjacent genes), described in, e.g., Egeland, et aL y 1987. 
Nature, 325: 783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with the AMFX gene, can be determined. If a mutation is 

25 observed in some or all of the affected individuals but not in any unaffected individuals, then 
the mutation is likely to be the causative agent of the particular disease. Comparison of 
affected and unaffected individuals generally involves first looking for structural alterations in 
the chromosomes, such as deletions or translocations that are visible from chromosome spreads 
or detectable using PCR based on that DNA sequence. Ultimately, complete sequencing of 

30 genes from several individuals can be performed to confirm the presence of a mutation and to 
distinguish mutations from polymorphisms. 
Tissue Typing 

The AMFX sequences of the invention can also be used to identify individuals from 
minute biological samples.. In this technique, an individual's genomic DNA is digested with 
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one or more restriction enzymes, and probed on a Southern blot to yield unique bands for 
identification. The sequences of the invention are useful as additional DNA markers for RFLP 
("restriction fragment length polymorphisms " described in U.S: Patent No. 5,272,057). 
Furthermore, the sequences of the invention can be used to provide an alternative 
5 technique that, detenmnesrthe.actaaibas^ DNA sequence of selected portions of an 

individual's genome. Thus, the AMFX sequences deseribed herein can be used to prepare two 
PGR primers from the 5 ? andl3I-tenmni of ^ can then be used to 

amplify ^mdividu^ ^ r ,u !r 

.v Panels of corresponding DNA. seq^gce&£om jndi^ can 
1 0 provide unique indi^ as each mdiviib^imU ha^ ^Mnique of such 

DN^sequencesd^jt^^ 

obtain such ideatjfieatiQn sequences [[from in#yidualsr^d^m 

of ths iayen^ genoa^.AIkli^ye^tipn occurs to 

some degree, in the coding regions of theseiseqi^iices, arid taa greater degree-in the noncoding 
15 regions*- It is estimated that alle^ 

frequency of about oace pe^ea^bi 3QQ {bases;; Much:0f the allelic yaria^on is duq to single ^ 
nucleotide poJy^rpWsms ^ 

Each, of the sequences described herein can* ..tpispm©; 4$gree> berused ast a standard : : 
20 against which EJNA ? from aa individual can comp^ed^for identification : purpp$es. L Beciause 
greater numbers, of polymorphisms .oqcur in the noncoding regions,_fewer sequences are,7 
necessary to differentiate individuals 

positive; individual identification with ap^ehqf p^^ 1^000 piro^ t^eadb yield a 

noncoding amplified sequence of 1 00 basest . If predicted coding sequences^such as those in - 
25 SEQJ®;NG§;1, 3, 5,7, % 11, 13, 15, 17 and 19, are us<^a more appropriate number of 
primers for positive individual identification would be 500-2j000.. : 
Predictive Medicine — ^ 

assays, prognostic assays, pharmacogenomics, and monitoring clinical trials are used for 
30 prognostic . (predictive> purposes to ^ thereby treat an indiyidual prophylactically. Accordingly, 
one aspect of the invention relates to diagnostic assays for determining AMFX protein and/or 
nucleic add expression as* well as AMFX activity, in the context of a biological sample (e.g. , 
blood; serum, cells, tissue) to thereby determine whether an individual is afflicted with a 
disease or disorder, or is at risk of d^yeloping.a disorder, associated with aberrant AMFX 
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expression or activity. The disorders include e.g. , disorders related to cell signal processing, 
cell adhesion or migration pathway modulation, for example, but not limited to, 
chemoresistance, radiotherapy resistance, survival in trophic factor limited secondary tissue 
site microenvironments, connective tissue disorders, tissue remodeling, oncogenesis, cancer of 
5 the breast, ovary, cervix, prostate, endometrium, stomach, colon, lung, bladder, kidney, brain, 
and soft-tissue, cellular transformation, developmental tissue remodeling, inflammation, blood 
clot formation and resorption, hematoppiesis, angiogenesis, multidrug resistance related to 
organic anion transporters, malignant disease progression, autocrine and paracrine regulation 
of cell growth, cellular responses to external stimuli, wasting disorders associated with chronic 

1 0 diseases and various cancers. The invention also provides for prognostic (or predictive) assays 
for determining whether an individual is at risk of developing a disorder associated with 
AMFX protein, nucleic acid expression or activity. For example, mutations in an AMFX gene 
can be assayed in a biological sample. Such assays can be used for prognostic or predictive 
purpose to thereby prophy tactically treat an individual prior to the onset of a disorder 

1 5 characterized by or associated with AMFX protein, nucleic acid expression, or biological 
activity.. . 

Another aspect of the invention provides methods for determining AMFX protein, 
nucleic acid expression or activity in an individual to thereby select appropriate therapeutic or 
prophylactic agents for that individual (referred to herein as ^ pharmacogenomics ,, ). 
20 Phaimacogenomics allows for the selection of ageaits (e.g., drugs) for therapeutic or 

prophylactic treatment of an individual based on the genotype of the individual (e.g., the 
genotype of the individual examined to determine the ability of the individual to respond to a 
particular agent.) 

Yet another aspect of the invention pertains to monitoring the influence of agents (e.g. , 
25 drugs, compounds) on the expression or activity of AMFX in clinical trials. 

These and other agents are described in further detail in the following sections. 
Diagnostic Assays 

An exemplary method for detecting the presence or absence of AMFX in a biological 
sample involves obtaining a biological sample from a test subject and contacting the biological 
30 sample with a compound or an agent capable of detecting AMFX protein or nucleic acid (e.g., 
mRNA, genomic DNA) that encodes AMFX protein such that the presence of AMFX is 
detected in the biological sample. An agent for detecting AMFX mRNA or genomic DNA is a 
labeled nucleic acid probe capable of hybridizing to AMFX mRNA or genomic DNA. The 
nucleic acid probe can be, for example, a full-length AMFX nucleic acid, such as the nucleic 
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acid of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17 and 19, or a portion thereof, such as an 
oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to, 
specifically hybridize under stringent conditions to AMFX mRNA or genomic DNA. Other 
suitable probes for use in the diagnostic assays of the invention are described herein. 
5 An agent for detecting AMFX protein is ?tn- antibody capable of binding to AMFX 

protein* preferably aaantibody with a detectable.labeL Antibodies can be polyclonal, or more 
preferably, monoclonal.. An intopt ^tibody, or a fragment F(ab'>2) can be 

used- : f The term I'la^kdlV-vyg^i regard to the probe or antibody, is intended to encompass direct 
labeling of the. pro^ to 

10 the-p^pbe ,_or antibody, M /: w;elt; as - 3 indu^,^^g. of the probe. or antibody by -.reactivity with 
another, re^eatthat ia directly labejed,, .^^pl^^f^dir^jabeling incline detection of a. 
prim^jmtih^^ of a DNA 

prQte^^:bio^.suc^th£^ itc^^ deje^ ; The tenn 

"biological sai^e«^ : i^^<tei to i^clucfejtfcwgSi sells , and^biologi^ fluids isolated from a 

15 subject, ^ well, a&t^ 

method.of Ae^yentioii ,can be used, iq^detect AM£X inl^A, p^otein, or, genomic DNA in a 
biological sample An yitro as ■ weU as //» viyo. For .example, yifratecluiiques for detection of 
AtylFX mBJNA. include Nor^ro hybrid^ vitro techniques 

for detection: of AMF X protein include enzyin^ l^^immuiio^ 

20 Westeniblo^^ 

detectionof AMFX genomic PNA include Southern hybridizations, ^Furthermore, in vivo 
techniques for detec^pn.of AMFX protein include introducing into a subject a labeled 
anti-AMFX antibody^ Por, example, the antibody can be labeled with a radioactive marker 
whose presence and location in a subject, can be.deteeted by standard imaging techniques. 

25 . ; In one embodiment, the biological sample contains protein molecules from the test 

subject, ^ternatiyely, the biological sample can contain mRNA molecules from the test 
subject or genomic DNA molecules from. the test subject . .A preferred biological sample is a 
peripheral blood leukocyte sample isolated by conventional means from a subject 

In another embodiment^ the methods further involve obtaining a control biological 

30 sample from a control subject, contacting thecontrol sample with a compound or agent capable 
of.detecting AMFX protein* mRNA, or genomi^.DNAj sjich. tiiat the presence of AMFX 
protein, mRNA or genomic DNA is detected in the biological sample, and comparing the 
presence of AMFX protein, mRNA or genomic DNA in.thp, control sample with the presence 
of AMFX protein, mRNA or genomic DNA in the test sample. 
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The invention also encompasses kits for detecting the presence of AMFX in a 
biological sample. For example, the kit can comprise: a labeled compound or agent capable of 
detecting AMFX protein or mRNA in a biological sample; means for determining the amount 
of AMFX in the sample; and means for comparing the amount of AMFX in the sample with a 
5 standard. The compound or agent can be packaged in a suitable container. The kit can further 
comprise instructions for using the kit to detect AMFX protein or nucleic acid. 
Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant AMFX 

1 0 expression or activity. : For example,; the assays described herein, such as the preceding 

diagnostic assays or the following assays, can be utilized to identify a subject having or at risk 
of developing a disorder associated with AMFX protein,, nucleic acid expression or activity. 
Alternatively, the prognostic assays can be utilized to identify a subject having or at risk for 
developing a disease or disorder. Thus, the invention provides a method for identifying a 

1 5 disease or disorder associated with aberrant AMFX expression or activity in which a test 

sample is obtained from a subject and AMFX protein or nucleic acid {e.g. ., mRNA, genomic 
DNA) is detected, wherein the presence of AMFX protein or nucleic acid is diagnostic for a 
subject having or at risk of developing a disease or disorder-associated with aberrant AMFX 
expression or activity. As used herein, a "test sample" refers to a biological sample obtained 

20 fromasubject of interest; For example, atest sample can be a biological fluid (e.g., serum), 
cell sample, or tissue.- . . 

Furthermore, the prognostic assays described herein can be used to determine whether a 
subject can be administered an agent (e.g., an agonist, antagonist, peptidomimetic, protein, 
peptide, nucleic acid,- small molecule, or other drug candidate) to treat a disease or disorder 

25 associated with aberrant AMFX expression or activity. For example, such methods can be 
used to determine whether a subject can be effectively treated with an agent for a disorder. 
Thus, the invention provides methods for determining whether a subject can be effectively 
treated with an agent for a disorder associated with aberrant AMFX expression or activity in 
which a test sample is obtained and AMFX protein or nucleic acid is detected (e.g., wherein 

30 the presence of AMFX protein or nucleic acid is diagnostic for a subject that can be 

administered the agent to treat a disorder associated with aberrant AMFX expression or 
activity). 

The methods of the invention can also be used to detect genetic lesions in an AMFX 
gene, thereby determining if a subject with the lesioned gene is at risk for a disorder 
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characterized by aberrant cell proliferation and/or differentiation. In various embodiments, the 
methods include detecting, in a sample of cells from the subject, the presence or absence of a" 1 
genetic lesion characterized by at least one of an alteration affecting the integrity of a gene 
encoding an AMFX^protein, or the misexpression of the AMFX gene. For example, such 
5; genetic lesions can be detected by ascertaining the existence of at least one of: (i) a deletion of 
one or more nucleotides from an? AMFX gene; (ii) an addition of one or more nucleotides to an 
AMFX gene; (iii) a substitution of one or more nucleotides of an AMFX gene, (iv) a 
chromosomal rearrangement of an AMFX gene; (v) an alteration in the level of a messenger 
RNA transcript of an AMFX gene, (W) aberrant m<*iificatioh of an AMFX gene, such as of the 
1 0 methylation pattern of die genomic -DNA£ the presenefc he k rft>n-wild-^pS ^plidihg pattern 
of aifiess^ 6f an AMFX' 

protein,- (cc)1allelic r toss of ian AMFX'genei- ahd^xy fitepj^^ 

of art AMFXproteinl 1 As described herem 4hfe»3Se * larffe riuMbcr 5 asSay t^hhiqft^^krfewri 1 
in the art ^chHj^ A j^ftfaired biological 

15 sampteis a peripheral Wood letikbc^satmple isolated by c&hvefttidrial ftte&iis fttirtfa subject;' 1 
However, any bioldgic^ c s^ 

example bub^Mueo^'cglSii- 01 -- l^'ouu svcmauou a*'** — -u-i:, :;^;:-uu. ? :?oi— ;- r 
In certain embbdMe^ 

polymerase chain reaction {FGS^'(see 9 &gi\ U ;E P&ehtNosr. '4,683^195 atnd '4,683^202), such as 
20 anchor PCR or RACE PGR, or, alternatively; in ligation chain reaction (LCR) (see, e.g.% 
Landegran, et al: y 19&& Science 241: 1077-108(^ ahd Nakaz^w^ eial% \994Prdc. Natt 
Acad Set USA 91* 360^364}/the4atter of which ean-te-partieifliariy u^e^fbr detecting point 
mutations in tiie AMFX^gene (i?ee, Abravaya; et aL; \995\-Nucl Acids Res. 23: 675-682). This 
method can include the steps 6f collecting a si^ple of cells-ftbm ai patient; isolating nucleic 
25 acid (e.g.- genomic; mRNAw both) from the cells of the sample, contacting the nucleic acid 
sample wth one or more primers that specifically hybridize to aitAMFX gene under 
conditions such that hybridization and amplification of the AMFX gene (if present) occurs, and 
detecting the presence or absence of an amplification product,' or detecting the size of the 
amplification product and comparing the-length to a control sample. It is anticipated that PCR 
30 and/or LCR may be desirable to use as a preliminary amplification step in conjunction with 
any of the techniques used for detec^g inutatibM described heieinr^^^^^-- 1 

Alternative amplification methods include: self sustained sequence replication (see, 
Guatelli, era/., 1990; Proc Natl Acad Set USA 87: 1874-1878), transcriptional amplification 
system (see, Kwoh, et ah, 1989. Prdc. Natl Acad Set 86: 1 173-1177); Qp Replicase 
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(see, Lizardi, et al, 1988. BioTechnology 6: 1 197), or any other nucleic acid amplification 
method, followed by the detection of the amplified molecules using techniques well known to 
those of skill in the art. These detection schemes are especially useful for the detection of 
nucleic acid molecules if such molecules are present in very low numbers. 

In an alternative embodiment, mutations in an AMFX gene from a sample cell can be 
identified by alterations in restriction enzyme cleavage patterns. For example, sample and 
control DNA is isolated, amplified (optionally), digested with one or more restriction 
endonucleases, and fragment length sizes are determined by gel -electrophoresis and compared. 
Differences in fragment length sizes between sample and control DNA indicates mutations in 
the sample DNA. Moreover, the use of sequence specific ribozymes (see, e.g., U.S. Patent No. 
5,493,53 1) can be used to score for the presence bf specific mutations by development or loss 
of a ribozyme cleavage site. .-. - ;u -_ : - ^ - ........ 

In other embodiments, genetic mutations in AMFX can be identified by hybridizing a 
sample and control nucleic acids, e.g. y DNA or RN A, to high-density sprays containing 
hundreds or thousands of oligonucleotides probes. See, e.g., Cronin, et aL, 1996. Human 
Mutation 7: 244-255; Kozal, et ah, 1996. Nat Med. 2: 753-759. For example, genetic 
mutations in AMFX can be identified in two dimensional arrays containing light-generated 
DNA probes as described in Cronin, et al., supra. Briefly, a first hybridization array of probes 
can be used to scan through long stretches of DNA in a sample and control to identify base 
changes between the sequences by making linear arrays of sequential overlapping probes. This 
step allows the identification of point mutations. This is followed by a second hybridization 
array that allows the characterization, of specific mutations by using smaller, specialized probe 
arrays complementary, to all variants or mutations detected. Each mutation array is composed 
of parallel probe sets, one complementary to the wild-type gene and the other complementary 
to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in the art 
can be used to directly sequence the AMFX gene and detect mutations by comparing the 
sequence of the sample AMFX with the corresponding wild-type (control) sequence. 
Examples of sequencing reactions include those based on techniques developed by Maxim and 
Gilbert, 1977. Proa Natl. Acad ScL USA 74: 560 or Sanger, 1977. Proa Natl Acad Sci. USA 
74: 5463. It is also contemplated that any of a variety of automated sequencing procedures can 
be utilized when performing the diagnostic assays (see, e.g., Naeve, et al., 1995. Biotechniques 
19: 448), including sequencing by mass spectrometry (see, e.g., PCT International Publication 
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No. WO 94/16101; Cohen, etaL y 1996. Adv. Chromatography 36: 127-162; and Griffin, et al., 
1993. Appl Biochem. Biotechnol 38: 147-159). 

Other methods for detecting mutations in the AMFX gene include methods in which v 
protection from cleavage agents is used to detect mismatched bases in RNA/RNA or 
.5 RNA/DNA heteroduplexes. See, e.g., Myers, et al, 1985: Science 230:: 1242. In general, the 
art technique of "mismatch cleavage' 1 starts by providing heteroduplexes of formed by * 
hybridizing (labeled) RNApr DNA containing the wild-type AMFX sequence with potentially 
mutant RNA or DNA obtained from a tissue sampler The double^strahded duplexes are treated 
with an agent that i cleave&single^str^ded^^fegions of the duplex:such as which will exist due to 

10 basepak mismatches; betw^n Ae control and sa^ 

duplexes can be;treated with RNase and DNA/DNA hybrids: treated with Si nuclease to 
enzymatically digesting, the mismatched regions. In other embodiments, either DNA/DNA or 
RN A/DNA duplexes can bertreated ^tK,hydroxylamirie~or osmium tetroxide and with :i - 
piperidine in orderto digest mismatched rSgionss; After digestion : of the mismatched regions, 

1:5 therresulting material; is then separated ;by size on deaaturing pplyac^lamide gels to determine 
the site of mutation, . See, eg.^Gotton, etqL,J9&8:PKOc:NattA 

Saleeba, et al, 1992. Methods Enz^ok^W In an embodiment^ the control DNA or 

RNA can labeled, for detection; -, , . r >.. r „ t ; ; . ; • \ . 

: In still another, embodiment, the. mismatch cleavage reaction employs one or more 

20 proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 

mismatch repair" enzymes) in defined systems for detecting and mapping point mutations in 
AMFX cDNAs obtained from samples of cells. For example, the mutY enzyme of E. coli 
cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells cleaves T 
at G/T mismatches; See; e.g., HsvL,etaL T 1994. Carcinogenesis J 5: 1657-1662. According to 

25 an exemplary embodiment* a probe based on an AMFX sequence, e.g. , a wild-type AMFX 
sequence, is hybridized to a cDNA or other DNA product from a test cell(s). The duplex is 
treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be 
detected from electrophoresis protocols or the like. See, e.g., U.S. Patent No. 5,459,039. 

In other embodiments, alterations in eleetrophoretic mobility will be used to identify 

30 mutations in AMFX genes. For example, single strand conformation polymorphism (SSCP) 
may be used to detect differences iaeiectrophoretic.mobility between mutant and wild type 
nucleic acids. See, e.g., Qrita, et ah, 1989. Proc. Natl Acad ScL USA: 86: 2766; Cotton, 1993. 
Mutat. Res. 285: 125-144; Hayashi, 1992. Genet. Anal Tech: Appl 9: 73-79. Single-stranded 
DNA fragments of sample and control AMFX nucleic acids will be denatured and allowed to 
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renature. The secondary structure of single-stranded nucleic acids varies according to 
sequence, the resulting alteration in electrophoretic mobility enables the detection of even a 
single base change. The DNA fragments may be labeled or detected with labeled probes. The 
sensitivity of the assay may be enhanced by using RNA (rather than DNA), in which the 
5 secondary structure is more sensitive to a change in sequence. In one embodiment, the subject 
method utilizes heteroduplex analysis to separate double stranded heteroduplex molecules on 
the basis of changes in electrophoretic mobility. See, e.g., Keen, et al, 1991. Trends Genet. 7: 

In yet another embodiment, the movement of mutant or wild-type fragments in 

1 0 polyacrylamide gels containing a gradient of denaturant is assayed using denaturing gradient 
gel electrophoresis (DGGE). See, e.g., Myers, et al, 1985. Nature 313: 495. When DGGE is 
used as the method of analysis, DNA will be modified to insure that it does not completely 
denature, for . example by adding a GC clamp of approximately 40 bp of high-melting GC-rich 
DNA by PCR. . In a further embodiment, a temperature gradient is used in place of a 

1 5 denaturing gradient to identify differences in the mobility of control and sample DNA. See, 
e.g., Rosenbaum and Reissner, 1987. Biophys. Chem. 265: 12753. 

Examples of other techniques for detecting point mutations include, but are not limited 
to, selective oligonucleotide hybridization,. selective amplification, or selective primer 
extension. . For example, oligonucleotide primers may be prepared in which the known 

20 mutation is placed centrally and then hybridized to target DNA under conditions that permit 
hybridization only if a perfect match is found. See, e.g., Saiki, et al, 1986. Nature 324: 163; 
Saiki, et al 9 1989. Proc. Natl Acad. Sci. USA 86: 6230. Such allele specific oligonucleotides 
are hybridized to PCR amplified target DNA or a number of different mutations when the 
oligonucleotides are attached to the hybridizing membrane and hybridized with labeled target 

25 DNA. 

Alternatively, allele specific amplification technology that depends on selective PCR 
amplification may be used in conjunction with the instant invention. Oligonucleotides used as 
primers for specific amplification may carry the mutation of interest in the center of the 
molecule (so that amplification depends on differential hybridization; see, e.g., Gibbs, et aL, 
30 1989. Nitcl. Acids Res. 17: 2437-2448) or at the extreme 3 f -terminus of one primer where, 

under appropriate conditions, mismatch can prevent, or reduce polymerase extension {see, e.g., 
Prossner, 1993^ Tibtech. 1 1 : 238). In addition it may be desirable to introduce a novel 
restriction site in the region of the mutation to create cleavage-based detection. See, e.g., 
Gasparini, et al, 1992. Mol Cell Probes 6: 1 . It is anticipated that in certain embodiments 
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amplification may also be performed using Tag ligase for amplification. See, e.g., Barany, 
1 99 1 . Proc. Natl Acad, Set USA 88:189. In such cases, ligation will occur only if there is 'a 
perfect match at the 3-terminus of the 5' sequence, making it possible to detect the presence of 
a known mutation at a specific site by looking for the presence or absence of amplification. 

The methods described herein may beperformed, for example, by utilizing 
pre-packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be conveniently used, e.g., in clinical settings to diagnose patients 
exhibiting symptonis or family history of a disease or illness involving an AMFX gene. 

Furthermore, any cell type or tissue, preferably peripheral blood leukocytes, in which 
AMFX is expressed may be utilized in the prognostic assays described herein. However, any 
biological sample containing nucleated 1 cells may be used*, including^ for examples buccal 
mucosal cells.: - ' :■" z:r::: . .:z::z:r~\ tt:: . .-.;"':r;: a 

Pharmacogenomicsc^^z^:^ _ ':t:.v:. • '-■Sr Z% ^-.;zfi^-;?? ■ 

Agent§5-or modulators that have a stimulatory ^pr inhibitory ef^ on AMFX activity 
(e.g. , AMFX gene expfessibn), as identified ^ be 
administered to iri^viduali^ta treat (prbphylactically or therapeutically) disorders (The 
disorders include, e.g. , disorders related to cell signal processing, cell adhesion or migration 
pathway modulation, for example; but not innited fo, chemoresistance, radiotherapy resistance, 
survival in trophic factor limited secondary tissu^ sitenricroenvironments, connective tissue 
disorders, tissue remodeling, oncogenesis, cancer of the breast, ovary, cervix, prostate, _\ 
endometrium, stomach, colon, lung, bladder, kidney, brain, and soft-tissue, cellular 
transformation, developmental tissue remodeling, inflammation, blood clot formation and 
resorption,- hematopoiesis, angiogenesis, multidrug resistance related to organic anion 
transporters, malignant disease progression, autocrine and paracrine regulation of cell growth, 
cellular responses to external stimuli, and-wasting disorders associated with chronic diseases 
and various cancers.) In conjunction with such treatment, the pharmacogenomics (i.e., the 
study of the relationship between an individual's genotype:and that individual's response to a 
foreign compound or drug) of the individual may be considered. Differences in metabolism of 
therapeutics can lead to severe toxicity or therapeutic failure by altering the relation between 
dose and blood concentration of the pharmacologically active drug. Thus, the 
pharmacogenomics of the individual permits the selection of effective agents (e.g., drugs) for 
prophylactic or therapeutic treatments based on a consideration of the individual's genotype. 
Such pharmacogenomics can further be used to determine appropriate dosages and therapeutic 
regimens. Accordingly , the activity of AMFX protein, expression of AMFX nucleic acid, or 
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mutation content of AMFX genes in an individual can be determined to thereby select 
appropriate agent(s) for therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected persons. See 
5 e.g., Eichelbaum, 1996. Clin. Exp. Pharmacol Physiol, 23: 983-985; Linder, 1997. Clin. 
Chem. , 43: 254-266. In general, two types of pharmacogcnetic conditions can be 
differentiated. Genetic conditions transmitted as a single factor altering the way drugs act on 
the body (altered drug action) or genetic conditions J^smitted as single factors altering the 
way the body acts on drugs (altered drug metabolism). These pharmacogenetic conditions can 
1 0 occur either as rare defects or as polymorphisms. For example, glucose-6-phosphate 

dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the main 
clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a major 
15 determinant of both the intensity and duration of drug action. The discovery of genetic 
polymorphisms of drug metabolizing enzymes (e.g. , N-acetyltransferase 2 (NAT 2) and 
cytochrome P450 enzymes CYP2D6 and CYP2C 1 9) has provided an explanation as to why 
some patients do not obtain the expected drug effects or show exaggerated drug response and 
serious toxicity after taking the standard and safe dose of a drug! These polymorphisms are 
20 expressed in two phenotypes in the population, the extensive metabolizer (EM) and poor 
metabolizer (PM). The prevalence of PM is different among differex^Upopulations. For 
example, the gene coding for CYP2D6 is highly polymorphic and several mutations have been 
identified in PM, which all lead to the absence of functional CYP2D6. Poor metabolizers of 
CYP2D6 and CYP2C19 quite frequently experience exaggerated drug response and side 
25 effects when they receive standard doses. If a metabolite is the active therapeutic moiety, PM 
show no therapeutic response, as demonstrated for the analgesic effect of codeine mediated by 
its CYP2D6-formed metabolite morphine. At the other extreme are the so called ultra-rapid 
metabolizers who do not respond to standard doses. Recently, the molecular basis of 
ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification. 
30 Thus, the activity of AMFX protein, expression of AMFX nucleic acid, or mutation 

content of AMFX genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping of polymorphic alleles encoding 
drug-metabolizing enzymes to the identification of an individual's drug responsiveness 
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phenotype. This knowledge, when applied to dosing or drag selection, can avoid adverse 
reactions or therapeutic failure and thus enhance therapeutic or prophylactic efficiency when 
treating a subject with an AMFX modulator, such as a modulator identified by one of the 
exemplary screening assays described herein. 

Monitoring of Effects During Clinical Trials ? = ' - 

- Monitoring the influence of agents (e g. , drugs; compounds) on the expression or - 
activity of AMFX gather abflity to modulate aberrant cell proliferation and/or 
differentiation}, can be applied natori>£,in,b^ in clinical trials. For 

example, the effectiveness of an agent determined by a screening assay as described herein to 
increase AMFX gen^ 
monitored in clihicai:tj^ 
levels, or dowi^^ 

determined by a s©re*enint£ assay to deerease r AM^;g£S^exp^ 

downregulate AMIX^cth^ . 
increased AMFX gene expression, protein levels, or upregulated AMFX activity. In such 
clinical trials, the expressionfor^^ and, preferably; other genes that have been 

implicated in, for" example, a cellidaFprolife^ or immune disorder can be used as a "read 
out" or markers of the immune responsiveness of a particular cell. 

By way of example, and not of limitation, genes, including AMFX, that are modulated 
in cells by treatment vrith an ''agextffegr, 66mp6\M^d^'or small m6IecuIe)^aV modulates 
AMFX activity (e:g: 9 identified ih k screetii&g askay as de^ribfed bei^in) cah be identified. 
Thus, to study the effect of agents on cellular proliferation disorders, for example, in a clinical 
trial, cells can be isolated and -RNA prepared and aiialyzedfc^the levels^of expression of 
AMFX and bther genes implicated in the disorder; The levels of gene expression (r e. ,~ a gene 
expression pattern) can b&j quantified by Northern blot analysfcor RT-PCR, as described 
herein, or alternatively by measuring the amount of protein produced^ by one of the methods as 
described herein, or by measuring the levels of activity of AMFX or other genes. In this 
manner, the gene expression pattern can serve as a marker, indicative of the physiological 
response of the cells to the agent. Accordingly, this response state may be determined before, 
and at various points during,- treatment of the individual with the agent. 

In one embodiment, the invention provides a method for monitoring the effectiveness 
of treatment of a subject with an agent (e.g. , an agonist, antagonist, protein, peptide, 
peptidomimetic, nucleic acid, small molecule, or other drug candidate identified by the 
screening assays described herein) comprising the steps of (/) obtaining a pre-administration 
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sample from a subject prior to administration of the agent; (//) detecting the level of expression 
of an AMFX protein, mRNA, or genomic DNA in the preadministration sample; (Hi) obtaining 
one or more post-administration samples from the subject; (iv) detecting the level of expression 
or activity of the AMFX protein, mRNA, or genomic DNA in the post-administration samples; 
5 (v) comparing the level of expression or activity of the AMFX protein, mRNA, or genomic 
DNA in the pre-administration sample with the AMFX protein, mRNA, or genomic DNA in 
the post administration sample or samples; and (vi) altering the administration of the agent to 
the subject accordingly. For example, increased administration of the agent may be desirable 
to increase the expression or activity of AMFX to higher levels than detected, ie. 9 to increase 
10 the effectiveness of the agent. Alternatively, decreased administration of the agent may be 
desirable to decrease expression or activity of AMFX to lower levels than detected, i.e., to 
decrease the effectiveness of the agent. 

Methods of Treatment 

The invention provides for both prophylactic and therapeutic methods of treating a 
1 5 subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant 
AMFX expression or activity. The disorders include, e.g., disorders related to cell signal 
-processing, cell adhesion or migration pathway modulation, for example, but not limited to, 
chemoresistance, radiotherapy resistance, survival in trophic factor limited secondary tissue 
site microenvironments, connective tissue disorders, tissue remodeling, oncogenesis, cancer of 
20 the breast, ovary, cervix, prostate, endometrium, stomach, colon, lung, bladder, kidney, brain, 
and soft-tissue, cellular transformation, developmental tissue remodeling, inflammation, blood 
clot formation and resorption, hematopoiesis, angiogenesis, multidrug resistance related to 
organic anion transporters, malignant disease progression, autocrine and paracrine regulation 
of cell growth, cellular responses to external stimuli, and other diseases, disorders and 
25 conditions of the like. 

These methods of treatment will be discussed more fully, below. 
Disease and Disorders 

Diseases and disorders that are characterized by increased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity may be treated with- 
30 Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that antEgcinize.. 
activity may be administered in a therapeutic or prophylactic manner. Therapeutics that may 
be utilized include, but are not limited to: (/) an aforementioned peptide, or analogs, 
derivatives, fragments or homologs thereof; (ii) antibodies to an aforementioned peptide; (///) 
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nucleic acids encoding an aforementioned peptide; (iv) administration of antisense nucleic acid 
and nucleic acids that are "dysfunctional" (i.e., due to a heterologous insertion within the 
coding sequences of coding sequences to an aforementioned peptide) that are utilized to 
"knockout" endogenous, function of an aforementioned peptide by homologous recombination 
5 (see, e.g., Capecchi, 1989. Science 244: 1288-1292); or (v) modulators ( i.e., inhibitors, 

agonists and antagonists, including additional peptide mimetic of the invention or antibodies 
specific . to a peptide of the invention) that alter the interaction between an aforementioned 
peptide and tik binding piteier. 

Diseases and disorders that are characterized by decreased (relative to a subject not 

10 suffering from the disease or disorder)-feVels or biological activity may be treated with 

Therapeutics: that increase (/. e. , are agonists to) activity.- Therapeutics that upregulate activity 
may ; be ^ administered in a therapeutic or prophylactic manner. Therapeutics that may be 
utilized include, but are not limited to, an aforementioned peptide, or analogs, derivatives, 
fragments or homologs thereof; or an agonist that increases bioavailability. 

15 x : ft Increased or decreased levels can be readily detected by quantifying peptide and/or 

KNA, by obtaining a patient tissue sampler (e.g., from biopsy tissue) and assaying it in vitro for 
KNA or pejptide levels, structure and/or activity of the expressed peptides (or mRNAs of an 
aforementioned peptide). Methods that are well-known; within the art include, but are not 
limited to, immunoassays (e.g., by Western blot analysjk, immunoprecipitation followed by 

20 sodium dodecyl sulfate (SDS) polyacrylamide gel electrophoresis, immimocytochemistry, etc.) 
and/or hybridization assays to detect expression of mRNAs (e.g, Northern assays, dot blots, in 
situ hybridization, and the like). 

ProphylaeticJxkthods - 

In one ^^ecCtbe invention provides a method for preventing^in a subject, a disease or 
25 condition associated with an aberrant AMFX expression or activity, by administering to the 
subject aiiL ; agent : that modulates AMFX expression or at least one AMFX activity. Subjects at 
risk for a disease that is caused or contributed to by aberrant AMFX expression or activity can 
be identified by, for example, any or a combination of diagnostic or prognostic assays as 
described herein. Administration of a prophylactic agent can occur prior to the manifestation 
30 of symptoms characteristic of the AMFX aberrancy, such that a disease or disorder is 

prevented or, alternatively, delayed in its progression.. Depending upon the type of AMFX 
aberrancy, for example, an AMFX agonist or AMFX antagonist agent can be used for treating 
the subject. The appropriate agent can be determined based on screening assays described 
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herein. The prophylactic methods of the invention are further discussed in the following 
subsections. 

Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating AMFX expression 
5 or activity for therapeutic purposes. The modulatory method of the invention involves 

contacting a cell with an agent that modulates one or more of the activities of AMFX protein 
activity associated with the cell. An agent that modulates AMFX protein activity can be an 
agent as described herein, such as a nucleic acid or a protein, a naturally-occurring cognate 
ligand of an AMFX protein, a peptide, an AMFX peptidomimetic, or other small molecule. In 

1 0 one embodiment, the agent stimulates one or more AMFX protein activity. Examples of such 
stimulatory agents include active AMFX protein anda nucleic acid molecule encoding AMFX 
that has been introduced into the cell. In another embodiment, the agent inhibits one or more 
AMFX protein activity. Examples of such inhibitory agents include antisense AMFX nucleic 
acid molecules and anti-AMFX antibodies. These modulatory methods can be performed in 

1 5 vitro {e.g. , by culturing the cell with the agent) or, alternatively, in vivo (e.g. , by administering 
the agent to a subject). As such, the invention provides methods of treating an individual 
afflicted with a disease or disorder characterized by aberrant expression or activity of an 
AMFX protein or nucleic acid molecule. In one embodiment, the method involves 
administering an agent (e.g., an agent identified by a screening assay described herein), or 

20 combination of agents that modulates (e.g. , up-regulates or down-regulates) AMFX expression 
or activity; In another embodiment, the method involves administering an AMFX protein or 
nucleic acid molecule as therapy to compensate for reduced or aberrant AMFX expression or 
activity. 

Stimulation of AMFX activity is desirable in situations in which AMFX is abnormally 
25 downregulated and/or in which increased AMFX activity is likely to have a beneficial effect. 
One example of such a situation is where a subject has a disorder characterized by aberrant cell 
proliferation and/or differentiation (e.g., cancer or immune associated disorders). Another 
example of such a situation is where the subject has a gestational disease (e.g. , preclampsia). 

Determination of the Biological Effect of the Therapeutic 

30 In various embodiments of the invention, suitable in vitro or in vivo assays are 

performed to determine the effect of a specific Therapeutic and whether its administration is 
indicated for treatment of the affected tissue. 
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In various specific embodiments, in vitro assays may be performed with representative 
cells of the type(s) involved in the patient's disorder, to determine if a given Therapeutic exerts 
the desired effect upon the cell type(s). Compounds for use in therapy may be tested in 
suitable animal model systems including, but not limited to rats, mice, chicken, cows, 
5 monkeys, rabbits, and the like, prior to testing in human subjects. Similarly, for in vivo testing, 
any of the animal model system known in the art may be used prior to administration to human 
subjects. 

Prophylactic and Therapeutic Uses of the Compositions of the Invention 

The AMFX nucleic acids and proteins of the invention are useful in potential- : 

10 prophylactic and therapeutic applications implicated in a variety of disorders including, e.g. , 
disorders related to cell signal processing, cell adhesion or migration pathway modulation, for 
example, but not limited to* chemoresistance, radiotherapy resistance^survival in trophic factor 
limited secondary tissue site miax>envirbriments, connective tissue disorders, tissue T 
remodeling, oncogenesis, cancer of the breast, ovary, cervix, prostate, endometrium, stomach, 

1 5 colon, lung, bladder, kidney, brain, and soft-tissue, cellular transformation, developmental 
tissue remodeling, inflammation, blood clot formation and resorption, hematopoiesis, 
angiogenesis, multidrug resistance related to organic anion transporters, malignant disease 
progression, autocrine and paracrine regulation of cell growth, cellular ^responses to external 
stimuli, disorders associated with chronic diseases and various cancers: ■ 

20 As an example, ^ cDNA encoding the AMFX protein of the invention may be useful in 

gene therapy, and the protein may be useful when administered to a subject in need thereof. 
By way of non-limiting example, the compositions of the invention will have efficacy for 
treatment of patients suffering from: e.g., disorders related to cell signal processing, cell 
adhesion or migration pathway modulation, for example; but not limited to, chemoresistance, 

25 radiotherapy resistance, survival in trophic factor limited secondary tissue site 

microenvironments, connective tissue disorders, tissue remodeling, oncogenesis, cancer of the 
breast, ovary, cervix, prostate, endometrium, stomach, colon, lung, bladder, kidney, brain, and 
soft-tissue, cellular transformation, developmental tissue remodeling, inflammation, blood clot 
formation and resorption, hematopoiesis, angiogenesis, multidrug resistance related to organic 

30 anion transporters, malignant disease progression, autocrine and paracrine regulation of cell 
growth, cellular responses to external stimuli. 

Both the novel nucleic acid encoding the AMFX protein, and the AMFX protein of the 
invention, or fragments thereof, may also be useful in diagnostic applications, wherein the 
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presence or amount of the nucleic acid or the protein are to be assessed. A further use could be 
as an anti-bacterial molecule (i.e., some peptides have been found to possess anti-bacterial 
properties). These materials are further useful in the generation of antibodies which 
immunospecifically-bind to the novel substances of the invention for use in therapeutic or 
5 diagnostic methods. 

EXAMPLES 

The following examples illustrate by way of non-limiting example various aspects of 
the invention. 

Example 1. Quantitative expression analysis of AMF-1-10 in various cells and tissues 

10 The quantitative expression patterns, of clones AMF- 1-10 were assessed in a large 

number of normal and tumor sample cells and cell lines by real time quantitative PCR 
(TaqMan®) performed on a Pericin-Elmer Biosystems ABI PRISM® 7700 Sequence Detection 
System. 

First, 96 RNA samples were normalized to 0-actin and GAPDH. RNA (~50 ng total or 

15 -1 ng polyA+) was converted to cDNA using the TaqMan® Reverse Transcription Reagents 
Kit (PE Biosystems, Foster City, CA; Catalog No. N808-0234) and random hexamers 
according to the manufacturer's protocol.. Reactions were performed in 20 ul and incubated 
for 30 min. at 48°C. cDNA (5 ul) was then transferred to a separate plate for the TaqMan® 
reaction using p-actin and GAPDH TaqMan® Assay Reagents (PE Biosystems; Catalog Nos. 

20 4310881E and 4310884E, respectively) and TaqMan® universal PCR Master Mix (PE 

Biosystems; Catalog No. 4304447) according to the manufacturer's protocol. Reactions were 
performed in 25 ul using the following parameters: 2 min: at 50°C; 10 min. at 95°C; 1 5 sec. at 
95°C/1 min. at 60°C (40 cycles). Results were recorded as CT values (cycle at which a given 
sample crosses a threshold level of fluorescence) using a log scale, with the difference in RNA 

25 concentration between a given sample and the sample with the lowest CT value being 

represented as 2 to the power of delta CT. The percent relative expression is then obtained by 
taking the reciprocal of this RNA difference and multiplying by 1 00. The average CT values 
obtained for B-actin and GAPDH were used to normalize RNA samples. The RNA sample 
generating the highest CT value required no further diluting, while all other samples were 

30 diluted relative to this sample according to their p-actin /GAPDH average CT values. 

Normalized RNA (5 ul) was converted to cDNA and analyzed via TaqMan® using One 
Step RT-PCR Master Mix Reagents (PE Biosystems; Catalog No. 4309169) and gene-specific 
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primers according to the manufacturer's instructions. Probes and primers were designed for 
each assay according to Perkin Elmer Biosystem's Primer Express Software package (version I 
for Apple Computer's Macintosh Power PC) or a similar algorithm using the target sequence 
as input. Default settings were used for reaction conditions and the following parameters were 
5 set before selecting primers: primer concentration = 250 nM, primer melting temperature (T m ) 
range = 58°-60° C, primer optimal Tm = 59° C, maximum primer difference = 2° C, probe does 
not have 5* G, probe T m must be 10° C greater than primer T m , amplicon size 75 bp to 100 bp. 
The probes and primers selected (see below) were synthesized by Synthegen (Houston, TX, 
USA). Probes were double purified by HPLC to remove uncoupled dye and evaluated by mass 

10 spectroscopy to verify coupling of reporter and quencher dyes to the 5' and 3' ends of the 
probe, respectively. Their final concentrations were: forward and reverse primers, 900 nM 
each, and probe, 200nM. 

PGR conditions: Normalized RNA from each tissue and each cell line was spotted in 
each well of a 96 well PCR plate (Perkin Elmer Biosystems). PCR cocktails including two 

15 probes (a probe specific for tlie target clone and another gene-specific probe multiplexed with 
the target probe) were set up using IX TaqMan™ PCR Master Mix for the PE Biosystems 
7700, with 5 mM MgC12, dNTPs (dA, G, C, U at 1 : 1 : 1 :2 ratios), 0.25 U/ml AmpliTaq Gold™ 
(PE Biosystems), an,d,0.4 U/jil RNase inhibitor, and 0.25 U/jxl reverse transcriptase. Reverse 
transcription was performed at 48° C for 30 minutes followed by amplification/PCR cycles as 

20 follows: 95° C 1 0 min, then 40 cycles of 95° C for 1 5 seconds, 60° C for 1 minute. 

AMF-1 : 



The nucleotide sequence used for TaqMan analysis on AMF-1 is indicated in Table 12. 
The oligonucleotide sequences used as primers are boxed and the oligonucleotide sequence 
used as a probe is underlined. 

25 Table 12. AMF-1 (1429510) Sequence Input for TaqMan Analysis 

(reverse strand of SEQ ID NO. 1) : 

CGGATGACTCCCGAGJUVGGTC^GCCCCT 

ACTGCCTGGGTCTCTGMTTTCAGGCTC 

CTTTTGTCCCCTTGGGGCCTTGAGTGTGGC 
30 CTGCCATCCGCAGACGTGAATGAGTGTGCAGAGAACCCTGGCGTCTGG^^ 

GATCCTTCCGCTGTGAGTGTCCCTTTGGCT7\CAGCCTC 

CTCTGTCGGCCACCCCTGTGGGG&AGGGAGk^ 

TTTGA GCCTGGCCT qATGATtiAUUTGC^ 

GCCACAATACCGAGGGCTCCTACCTGTGCACCTGTC 
35 AGATGTGGACGAGTGTGCAGATGGTCAGCAGGACTGCCACGCCCGGGGCAT^ 

TTCGCGTGCGTCTGTCCCCCAGGCATGCGGCCCCTGCCTGGCTCTGGGGAGGGCTGCACAGATGACAATGAATG 

ACGCTCAGCCTGACCTCTGTGTCAACGGCCGCTGTGTCAACACCGCGGGCAGCTTCCGGTGCGACTGTGATGAGGG 

ATTCCAGCCCAGCCCCACCCrrTACCGAGTGCCAC^ 

ATGTGCCGGTCTCTGTCCAGC^GCAGTGAGGCTGTCACCAGGGCCGAGTGCTGCTGTGGGGGTGGCCGGGGCTGGG 
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10 



15 



45 



GGCCCCGCTGCGAGCTCTGTCCCCTGCCCGGCACCTCTGCCTACAGGAAGCTGTGCCCCCATGGCTCAGGCTACAC 
TGCTGAGGGCCGAGATGTAGATGAATGCCGTATGCTTGCTCACCTGTGTGCTCATGGGGAGTGCATCAACAGCCTT 
GGCTCCTTCCGCTGCCACTGTCAGGCCGGGTACACACCGGATGCTACTGCTACTACCTGCCTGGATATGGATGAGT 
GCAGCCAGGTCCCCAAGCCATGTACCTTCCTCTGCAAAAACACGAAGGGCAGTTTCCTGTGCAGCTGTCCCCGAGG 
CTACCTGCTGGAGGAGGATGGCAGGACCTGCAAAGACCTGGACGAATGCACCTCCCGGCAGCACAACTGTCAGTTC 
CTCTGTGTCAACACTGTGGGCGCCTTCACCTGCCGCTGTCCACCCGKS^ 

ACAATGATGAGTGCTC^GCCCAGCCTGGGCCATGTGGTGCCCACGGGCACTGCCACAACACCCCGGGCAGCTTCCG 
CTGTGAATGCCACCAAGGCTTCACCCTGGTCAGCTCAGGCCATGGCTGTGAAGATGTGAATGAATGTGA 
CACCGCTGCCAGCATGGCTGTCAG^ 

CCCAGTGGGCCC^GTGTGTGGGT^GTGAAAAGGGCTGGGAAGAAGCTGGGCCCTCCACCAGAATCTGCTCAGAGC 
AGGCGAC TAACAGACGCCAC CCTGCAAG ATGATGTG ACAAG CACAATT ATCTAAAGAT"TGAAC AGG GCAGC CC AGA 
AGATGAGAATGAGTGTGCCCTGTCGCCC (SEQ ID NO. 21) 

The following primer and probe sequences were used for TaqMan analysis of AMF-1. 

Ag 390 (F) : 5 ■ -ACCAATGTCATCGGAGGCTT-3 1 (SEQ ID NO. 22) 
Ag 390 ( R ) : ~ 5 T - GAT^TCCTCGCAGGTCATCAT- 3* 1 (SEQ ID NO. 23) 
Ag. 390 (P) FAM 7 5 V -TCAAAGCCGTCAGCACAGGCACA-3 ' -TAMRA . (SEQ ID NO. 24) 

AMF-2 

20 The nucleotide sequence used for TaqMan analysis on AMF-2 is indicated in Table 13. 

The oligonucleotide sequences used as primers are boxed and the oligonucleotide sequence 
used as a probe is underlined. v 

Table 13. AMF-2 (20421338 ) Sequence Input for TaqMan Analysis 

(reverse strand of SEQ id no. .3) : . 

25 " GGAGGGCCTGTGATTCTACTGCAGGCAGGCACCCCCC 

CGAAGCOTGCTGCC^C<^T(^TCTGGTT.CCGGGACGGGACGCAGCAGGAGGGCGCTGTGGCCAG 
CACGGAATTGCTGAAGGATGGGAAGAGGGAGACCAC CGTGAGC CAACTGCTTATTAACC C CACG 

P^^CrxucAXuG^ CTGGATGTGCACCACCCTCCTACAGTGACCC 

30 GGGGC^GGAGGGTGAGCGTGTTCTCtTTACCTGCCAGGCCACAGCCAAC (SEQ 

ID NO. 25). - 

The following primer and probe sequences were used for TaqMan analysis of AMF-2. 

35 Ag 2 71 (F) : 5 * - ACCTGGACATAGGGCGTGTCT- 3 (SEQ ID NO. 26) 
Ag 271 (R) : 5 1 -TCGATGGAAGTCTCCTTGCC-3 ' (SEQ ID NO. 27) 

Ag 271 (P) : FAM-5 1 -CGAAGCATGAACGAAGCCATCCCTAG-3 1 -TAMRA (SEQ ID NO. 28) 
AMF-3 

40 The nucleotide sequence used for TaqMan analysis on AMF-3 is indicated in Table 14. 

The oligonucleotide sequences used as primers are boxed and the oligonucleotide sequence 
used as a probe is underlined. 



Table 14. AMF-3 (27251385) Sequence Input for TaqMan Analysis 

(reverse strand of SEQ ID NO. 5) : 
TCCAATCTCACATGCACGCA 

ACCCGTCATCCGGGCAAAGCCCCGGGCACAGGCAGGGTCGATCT 
CCGAGG (SEQ ID NO. 29) 
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The following primer and probe sequences were used for TaqMan analysis of AMF-3. 

Ag 72 v 
F CGGAAAGACCCAGCAGTGTT (SEQ ID NO. 30) 
5 R ATGATGTGAACGAGTGTGAGTCCTT (SEQ ID NO. 31) 

P Fam- CGCCCGTTGGGACAGACTCCC- Tamr a (SEQ ID NO. 32) 



AMF-4 

The nucleotide sequence used for TaqMan analysis on AMF-4 is indicated in Table 15. 
1 0 The oligonucleotide sequences used as primers are boxed and the oligonucleotide sequence 
used as a probe is underlined. — 

Table 1 5. AMF-4 (27486474) Sequence Input for TaqMan Analysis. 

TCACGGGAATAAGCCTGGGCCCGTCCCTTTGLAf^ 
- CGTGTACGGTGGCATCATCTCCC^ CT - 
15 GGACAGCTGCCAGGGGGACAGCGGGGGGCCCCTGGTGTGTCAAGAGAGGAGGCTGTGGAA^. 
■ > GTTAGTGGGAGCGACCAGCTTTGGCATCGGCTG 
^ CACGGTGTCACCTCCTTCCTGGACTGGATCCACGAGCAGATGGAGAGAGAC.CTAAAAACC; , 
TGAAGAGGAAGGGGATAAGTAGCCACCTGAGTTCCTGAGK3^ 
. ; iv ,. . TCCCCTGGACTCCCGTGTAGGAACCTGCACACGAGCAGACACCCTTGGAGCTCTGAGTTC 
20 CGGGACCAGTAGCAGGCCC (SEQ ID NO. 33) 

The following primer and probe sequences were used for TaqMan analysis of AMP- 4 . 

Ag 248 (F) : 5 1 - TTTCCAACAAGATCTGCAACCA- 3 1 (SEQ" ID NO. 34) 
25 Ag,248 (R) 5 » r- AGGTAGCCCGCGCAGAGr 3 ' (SEQ ID NO- 35) 

Ag 248 (P) : FAM-5 ' -CGTGTACGGTGGCATCATCTCCCC-3 1 -TAMRA (SEQ ID MO . 36) 



AMF-5 " • " - 

The nucleotide sequence used for TaqMan analysis on AMF-5 is indicated in 
30 Table 16. The oligonucleotide sequences used as primers are boxed and the oligonucleotide 
sequence used as a probe is underlined. 

Table 16, AMF-5 (29691387) Sequence Input for TaqMan Analysis 

TGTCATTGTC CTTTTAC CTATTATATTTTTT CATACTCTGTGAAAACAAAT CAGTTG C CGG ACTIAAC CATGACCTATGATGG AA 
ATAATCCAGTGACATCTCATAGAGATGTG CCACTTT CTTATTGC [AA(JrCAGAOTG CAATTGl'GA'i'GAASpTCAGTGGGAACCAG 
3 5 TCTGTGGGAAC^qGGMTAAlTIV^^^ 

TTTATAACTGTAGTTGTGTGGAAGTAACTGGTCTCCAGAACAGAAATTACT 

CTTGTACAAGGAAATTTTTCATCTATGTTGCAATTCAAGTCATAA (SEQ ID 

NO. 37) 



40 The following primer and probe sequences were used for TaqMan analysis of AMF-5. 

Ag 287 (F) : 5 ' - AACTCAGACTGCAATTGTGATGAAA- 3 ' (SEQ ID NO. 38) 

Ag 287 (R) : 5 1 - CTAGACAAGGTGACAGGTAAGTTATTCC- 3 ' (SEQ ID NO. 39) 

Ag 287 (P) : TET-5 1 -TTGTTCCCACAGACTGGTTCCCACTGT-3 * - TAMRA {SEQ ID NO. 40) 
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AMF-6 

The nucleotide sequence used for TaqMan analysis on AMF-6 is indicated in Table 17. 
The oligonucleotide sequences used as primers are boxed and the oligonucleotide sequence 
used as a probe is underlined. 

5 Table 17. AMF-6 (38905521) Sequence Input for TaqMan Analysis 

TGGCAGCCC TGGAGGAGCCGATGGTGGACC TGGACGGCGAGCTGCCTTTCGTGCGGCCCCTGCCCCACATTGCCGT 
G CTCCAGGACp A^TUCCG 

GCAGGAACGGAGGGACCTGCCTCCTGGGCCT 
10 TGAGATTTGTCCTCCGGA ( SEQ ID NOl '41) ' ^ ~ipC~ ^ ...... 

The following primer and probe sequences were used for TaqMan analysis of AMF-6. 

Ag 252 (F) .: 5 * - GAGCTGGCGCAACTGTTCC- 3 1 ..(3EQ ID: NO.. 42) : 
15 Ag 252 (R) : 5 1 - GACAAACTTCTCTGTGAGCGTGTG- 3 ' (SEQ ID NO. 43) 

Ag 252 (P) : TET-5 ' -CGCAACTCTGCCTCTTCCTCATCGG-3 1 -TAMRA (SEQ ID NO. 44 J 

AMF-7 • \. 

The nucleotide sequence used for TaqMan analysis on AMF-7 is indicated in Table 18. 
20 The oligonucleotide sequences used as primers are boxed and the oligonucleotide sequence 
used as a probe is underlined: 

Table 18. AMF-7 (4194093) Sequence Input for TaqMan Analysis 

(reverse strand of SEQ ID NO. 13) : 

c 9 cctt catgctgccggcgggctgctcgcg^ 
25 9er^cgcctgcgcacagcgacaattgcaattg : gagcaga^cct:gcgcgt:tt:gccgtcggct 
gctgcatgcctgggaaccaactgggacccgggctttgaagccacctc£agggccagaaac 
taatggagaggacccccttccagcatgcacacccagtccacaagacctcaaagagttgga 
gtttctgacccaggcactggagaaggctgtacgagttcgaagaggcatcactaaggccga 
a 9agagagacaaggcccccagcctgaaatctaggtccattgtcacctcttctggcacgac 
J 0 a 9cctccgccccaccgcattccccaggccaagctggtggccatgcttcagacacgagacc 
caccaagggcctccgccagaccacggtgcctgccaagggccaccctgagcgccggctgct 
gtcagtgggggatgggacccgtgttgggatgggagcccgaacccccaggcctggggcggg 
cctcagggaccagcaaatggccccatccgctgctcctcaggccccagaagccttcacact: 
caaggagaaggggcacctgctgcggctgcctgc ggcat:t:caggaaagca"g5^g cccagaa 
35 ctcp;acrcct atqqacccaactcacrtl:ccacacaq]accagtgattccacggatgc| cgccgc 
tgccaaaacccagttcctccagaacatgcagacagcttcaggcgggccccagcccaggct 
cagtgctgtggaggtggaggcggaggcggggcgcctgcggaaggcctgctcgctgctgag 
actgcgcatgagggaggagctctcagcagcccccatggactggatgcaggagtaccgctg 
cctgctcacgctggaggggctgcaggccatggtgggccagtgtctgcacaggctgcagga 
gctgcgtgcagcggtggcggaacagccaccaagaccatgtcctgtggggaggccccccgg 
agcctcgccgtcctgtgggggtagagcggagcctgcatggagcccccagctgcttgtcta 
ctccagcacccaggagctgcagaccctggcggccctcaagctgcgagtggctgtgctgga 
ccagcagatccacttggaaaaggtcctgatggctgaactcctccccctggtaagcgctgc 
acagccgcaggggccgccctggctggccctgtgccgggctgtgcacagcctgctctgcga 
gggaggagcacgtgtccttaccatcctgcgggatgaacctgcagtctgagcct.ttcccat 
gctgccctcggc (SEQ ID NO. 45) 

The following primer and probe sequences were used for TaqMan analysis of AMF-7. 
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AblS <F) : 5' -GGCATTCAGGAAAGCAGCTT-3' (SEQ ID NO. 46) 

Abl6 (R) : 5 ' - GCATCCGTGGAATCACTGGT - 3 ' (SEQ ID NO. 47) 

AblS (P) : FAM - 5 * - TGGGC C CAG CTCAGTTCCACACA- TAMRA (SEQ ID NO. 48) 

5 AMF-8 

_ The nucleotide sequence used for TaqMan analysis on AMF-8 is indicated in 
Table 19. Hie oligonucleotide sequences used as primers are boxed and the oligonucleotide 
sequence used as a probe is underlined. - * 

Table 19. AMF-8 (AC01 1036_A) Sequence Input for TaqMan Analysis 

10 (reverse strand, of:, SBQ V II) NQ, ;: l£.).:.. , : uiC :;-.:;vi:-...,r,i:j., .. 

ATGGAGGCTCAACAGTACGAGCAGCAGCGTCGA/J^ 

CTGTGGATACTGCTGAAG<^GGGAAGAAA<^^ 

GTGGAGTGTGTGTGTGCCCACCAGTGGAGaCTGTGGG 

AAGCAAACCATGAAGACCCAGAGATGTAAGATCCCCTGCA^ 
1 5 AGTTCGflGGCCTGGGGAGAATGTGACCTGAACACAGCCCTGAAGACCAGAACT 

[CAATGGCGAAT^ CCAGAAGACTG TCACCATCTCCAAGCC^ 

„ CTAGAACTTAAAGTAAAAAA^U^AAAAAAAAAAAAAAAATTCT (SEQ ID NO, 4 9h_ . 



20 



The following primer and probe sequences, were used for TaqMan analysisipf AMF-8. 

Ag 177' (P) : 5 1 -CCCTGCACAATGCCGAAT--3 * (SEQ~ ib /NO . 50) 
Ag ; l^Sr *R> 3.r-TGAGGTTTGGGCTTGGTCAG-3 1 (SEQ ID NO. 52) 

Ag 177 (P) : TET-5 ' - CACCATCTCCAAGCCCTGTGGCAA- 3 1 -TAMRA (SEQ ID NO. 52) 

25 AMF-9 ,;:::r: ''- i A: '' Ji ----- ; : - — •-• ■ 

The nucleotide sequence used for TaqMan analysis on AMF-9 is indicated in Table 20. 
The oligonucleotide sequences used as primers are boxed and the oligonucleotide sequence 
used as a probe is underlined. 

Table 20. ANIF-9 (AL307658) Sequence Input for TaqMan Analysis 

3 0 TTTTrGAAGTTTTCACTCATAAATC 
TAGC^AAAATC^TCTTGAl^ 
[?m5GAATGGTGCCCAGCA| C AGAGCAAAGAGA 

AAGAGGCAGGAGGAAGAGGATGACAAGGATGAAGGTGGTGTAGATCTTCTGGTGCACAGGG 
AAGCAGCAGATGTGTTCCTTTTCATATAGGAAGTCATATTTGA^ 
35 CTACGATGACTGCCACCAGCCAGACCACACCTAGCATTGTGAAAGCCCTTCG (SEQ ID NO. 53) 

The following primer and probe sequences were used for TaqMan analysis of AMF-9. 

GPCR 13 (P) : 5 ' - ATGGAATGGTGCCCAGCA- 3 ' (SEQ ID NO. 54) 
GPCR 13 (R) : 5 ' - TGGAAGAAGAAACGAGCTGTCA- 3 ' (SEQ ID NO. 55) 
40 GPCR 13 (P) : 5' -CAGCAAAGAGAGCCACCACTGTCACCA-3 ' (SEQ ID NO. 56) 

AMF-10 

The nucleotide sequence used for TaqMan analysis on AMF-10 is indicated in Table 
21. The oligonucleotide sequences used as primers are boxed and the oligonucleotide sequence 
used as a probe is underlined. 
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Table 21. AMF-10 (G55707_A) Sequence Input for TaqMan Analysis 



The following primer and probe sequences were used for TaqMan analysis of AMF-10. 

Ag 191 (F) : 5 ' -GACTTACTCCATCGCTGAGAAC3CT-3 » (SEQ ID NO. 58) 
Ag 191 (R) 5 ' -GCTGGTGATCGTATTAGCCGA-3 » (SEQ ID. NO. 59) 

Ag 191 (P) : FAM-5 ' -CATCAATGCCAGCTTTTTCCAGTCTTCC- 3 ■ -TAMRA (SEQ ID NO. 60) 

Example 2. Quantitation of AMFX gene expression using TaqMan analysis. 

The quantitative expression patterns of clones AMF- 1 -10 were assessed in a large 
number of noffliai and tumor saniple cells arid cell lines rjy real time quantitative PGR 
(TaqMan®) performed 6ii a Perkin-Elmer Biosystems ABI PRISM® 1 7700 Sequence Detection 
System. Table 21 shows the expression patterns of AMF-1 , AMF-2, AMF-4, and'AMF-6. 




Table 21. AMF-X gene expression in cells and tissues. 



Normal & Tumor Tissues 



AFM-1 



AMF-2 



AMF-6 



AMF-4 



Relative Expression (%) 



Endothelial cells . 
Endothelial cells (treated) 
Pancreas 

Pancreatic ca. CAPAN 2- . 

Adipose 

Adrenal gland 

Thyroid 

Salivary gland 

Pituitary gland 

Brain (fetal) 

Brain (whole) 

Brain (amygdala) 

Brain (cerebellum) 

Brain (hippocampus) 

Brain (hypothalamus) 

Brain (substantia nigra) 

Brain (thalamus) 

Spinal cord 

CNS ca. (gtio/astro) US7-MG 
CNS ca. (glio/astro) U-l 1 8-MG : 
CNS ca. (astro) S Wl 783 
CNS ca.* (neuro; met ) SK-N-AS 
CNS ca. (astro) SF-539 
CNS ca. (astro) SNB-75 
CNS ca. (glio) SNB- 19 
CNSca. (glio)U251 



0.00 
0.00 
0.00 
0.00 
2.66 
0.00 
.7.3& 
5.87 
0.00 
100.00 

3;oo 

0.80 
1.44 
2.80 
5.63 
7.33 
2.01 
1.18 
0.00 
0.00 
0.00 
0.00 
0.00 
0.00 
2.68 
0.00 



4.97 
4.30 
3.06 
23.98 
39.78 
8.19 
6.08 - 
4.09 
- 10.22 
8.96 
3.74 " 
1.66 
10.51 
1.18 
3.42 
3.52 
2.70 
3.96 
23.98 
24.83 
17.08 
17.56 
27.36 
65.07 
53.59 
26.79 



17.31 
5.15 
13.03 
10.73 
62.85 
4.30 
6.56 
15.60 
2:29 
1.08 
0.12 
0.19 
6.75 
0.00 
1.07 
0.26 
0.46 
1.69 
0.00 
33.22 
37.37 
0.00 
3.54 
4.07 
0.00 
0.23 



0.00 

0.00 

14.66 

0.00 

0.00 

0.00 

11.27 

13.58 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

6.79 

0.01 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 
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CNS ca. (glio) SF-295 


0.00 


33.45 


15.71 


3.33 


Heart 


0.00 


4.54 


15.18 


0 00 


Skeletal muscle 


0.00 


1.91 


0.32 


0.00 


Bone marrow 


0.00 


1.73 


6.34 


0.00 


Thymus 


1.86 


18.95 


56.64 


0.00 


Spleen 


0.00 


5.08 


9.09 


0.29 


Lymph node 


0.00 


6.04 


32.09 


2 10 


Colon (ascending) 


0.81 


3.24 


0.21 


0.01 


Stomach 


0.00 


11.99 


18.82 


26.24 


Small intestine 


0.00 


8.66 


9.02 


2.84 


Colon ca. SW480r . 


- 0.00 


1.85 


0.61) 


0 00 


Colon ca.* (SW480 met)SW620 


0.18 


2.42 


0.00 


10.88 


Colon ca. HT29 


0.00 


1.75 


0.87 


0.00 


Colon ca;HCT-116; v 


-• 2.72 ... 


: , 10.37 . 


2.47 ;;- 


0.00 


Colon ca. CaCo-2 


21.92 


21.16 


3.93 


0 00 

v.vv 


Coldh ca.HCT-15 ! - ■ 


1.99 


4.97 


4.61 


9 67 


Colon ca. HCC-2998 


0.00 


1.15 


11.58 


0 00 

v.vv 


Gastric ca.* (liver met) NC1-N87 


91.38 


3.06 


85.86 


100 00 

I vv. vv 


Bladder; A iniva-: v.- 


0.00: - 


.15,93 :'' -■ 


29.32 


0 00 


Trachea 


0.00 


7.03 


32.09 


40 61 


Kidney """"^ ' 


7.59 


8.90 


8.66 


0 0? "£ 
V. \J4f 2 


Kidney (fetal) 


46.65 


55.86 


32.09 




Renal ca. 786-0 


0.00 


96.59 


28.13 


0.00 


Renal ca. A498 ■*'■ : 


6.00 


65.52 


40.90 


0 00 

V. W 


Renal ca. RXF 393 


0.00 


27.74 


18.82 


0 00 
v.vv 


Renal ca. ACHN 


0.00 


65.07 


5.79 


0 00 

v.vv 


Renal ca. UO-31 


0.00 


41.75 


17.31 


0.00 


Renal ca. TK- 10 


0.00 


56.64 


8.84 


0 00 
v.vv 


Liver 


0.13 


3.30 


11.99 


2 76 


Liver (fetal) 


0.05 


2.35 


2.32 


0.00 


Liver ca. (hepatoblast) HepG2 


14.66 


0.02 


0.00 


0.27 


Lung 


7.75 


8.02 


42.93 


0 04 


Lung (fetal) 


81.79 


1 1.91 


100.00 


0.01 


Lung ca. (small cell) LX-1 


1.61 


1.35 


11.34 


48.97 


Lung ca. (small cell) NCI-H69 


0.04 


4.15 


0.00 


0.00 


Lung ca. (s.cell var.) SHP-77 


0.32 


0.36 


0.00 


0.00 


Lung ca. (large celI)NCI-H460 


0.00 


26.98 


0.41 


0.00 


Lung ca. (non-sm. cell) A549 


0.13 


7.13 


0.78 


0.00 : 


Lung ca. (non-s.cell) NCI-H23 


0.00 


7.08 


2.38 


0.00 


Lung ca (non-s.cell) HOP-62 


0.00 


15.82 


1.30 


0.00 


Lung ca. (non-s.cl) NCI-H522 


1.31 


5.37 


15.28 


0.00 


Lung ca. (squam.) SW 900 


0.00 


17.08 


17.08 


0.00 


Lung ca. (squam.) NCI-H596 


0.02 


8.66 


0.00 


0.00 


Mammary gland 


0.23 


45.06 


55.10 


31.86 


Breast ca.* (pi. effusion) MCF-7 
Breast ca.* (plief) MDA-MB-231 


0.00 


0.00 


4.15 


8.30 


0.00 


15.07 


0.83 


0.00 


Breast ca.* (pi. effusion) T47D 


3.61 


5.33 


8.72 


57.83 


Breast ca. BT-549 


0.00 


65.07 


97.94 


0.00 


Breast ca. MDA-N 


0.00 


25.70 


0.00 


0.00 


Ovary 


0.28 


39.50 


14.97 


3.52 


Ovarian ca. OVCAR-3 


7.48 


32.31 


1.24 


0.21 


Ovarian ca. OVCAR-4 


8.78 


32.99 


1.03. 


6.93 


Ovarian ca. OVCAR-5 


0.00 


35.60 


36.10 


0.73 


Ovarian ca. OVCAR-8 


0.00 


20.03 


13.58 


1.04 
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Ovarian ca IOROV-1 


n c\a 


A1 OA 


U.OiS 


A AA 
0.00 


Ovarian ca * /acritec^ ^lf-OV-'? 


ft ftft 


ai <i 


S.O / 


A AA 

0.00 


TV/T vrviri A*h*i i itm 


i m 


11 AO 


i n ao 


0.16 


Uterus 


5.45 


y.y4 


1 O AO 


0.29 


Placenta 


ft ftft 


ZJ .oz 


4.y / 


0.05 


Prostate 


ft 

u.zy 


0. / J 


4o.y& 


0.65 




ft Art 


S / .OS 


7.56 


0.00 


Testis 


O.Z J 


ZJ.5Z 


i /Ay 


A AA 

0.00 


Melanoma Hs688(A).T 


0.00 




AA AA 


ft Oft 


Melanoma* (met) Hs688(B).T 


0.00 


25.35 


38.69 


0.00 


Melanoma UACC-62 


0.00 


23.00 


0.02 


0.00 


Melanoma M14 


0.00 


36.10 


1.13 


0.00 


Melanoma LOX IMVI 


0.00 


100.00 


0.01 


0.00 


Melanoma* (met) SK-MEL-5 ; 


0.00 . 


10.88 


0.10 


0.00 


Melanoma SK-MEL-28 


0.00 


79.00 


11.91 


0.00 


Melanoma UACC-257 


0.00 


0.00 


0.00 


0.00 




TM407F 


: TM418.F- 


TM371 


TM416F 



The quantitative expression patterns of clones AMF-1-10 were assessed in a large 
number of normal and tumor sample cells and cell lines by real time quantitative PCR 
(TaqMan^ performed on a Perkin-Elmer Biosystems ABI PRISM® 7700 Sequence Detection 
5 System. Table 22 shows the expression patterns of AMF-3, AMF-7, AMF-8, and AMF-10. 

Table 22. AMF-X gene expression in cells and tissues. 
Normal & Tumor Tissues AMF-10 AMF-8 AMF-3 AMF-7 



Relative Expression (%) 



Endothelial cells 


0.00 


0:58 


0.02 


0.39 


Endothelial cells (treated) 


0.00 


0.23 


0.09 


0.57 


Pancreas * 


0.08 


3.15 


0.17 


0.21 


Pancreatic ca. CAPAN 2 


0.00 


0.62 


0.10 


1.64 


Adipose 


0.47 


8.13 


2.47 


0.00 


Adrenal gland 


0.00 


2.47 


0.64 


0.51 


Thyroid 


0.00 


7.54 


1.31 


0.53 


Salivary gland 


0.00 


4.54 


1.69 


0.45 


Pituitary gland 


0.01 


19.75 . 


0.04 


0.08 


Brain (fetal) 


0.00 


20.03 


41.18 


3.35 


Brain (whole) 


0.00 


37.89 


0.01 


3.52 


Brain (amygdala) 


0.00 


20.45 


15.28 


0.96 


Brain (cerebellum) 


0.00 


100.00 


100.00 


1.92 


Brain (hippocampus) 


0.00 


22.53 


28.52 


6.61 


Brain (hypothalamus) 


0.00 


76.31 


4.24 


1.28 


Brain (substantia nigra) 


0.00 


30.57 


22.69 


1.67 


Brain (thalamus) 


0.00 


29.32 


9.21 


2.43 


Spinal cord 


0.00 


35.11 


1.76 


0.59 


CNS ca. (glio/astro) U87-MG 


0.00 


8.66 


0.01 


1.49 


CNS ca. (glio/astro) U-l 18-MG 


100.00 


2.18 


0.01 


3.52 


CNS ca. (astro) SW1783 


4.15 


1.61 


0.00 


1.16 


CNS ca.* (neuro; met ) SK-N-AS 


0.00 


38.42 


0.95 


9.41 


CNS ca. (astro) SF-539 


0.00 


3.61 


0.00 


1.12 


CNS ca. (astro) SNB-75 


0.00 


23.98 


0.00 


1.45 



123 



BNSDOCIO: <WO__01748S7A2J_> 



WO 01/74897 



PCT/US01/10892 



CNS ca. (glio) SNB-19 


0.00 


33.68 


0.48 


1.03 


CNS ca. (glio) U251 


0.18 


9.41 


0.12 


0.88 


CNS ca. (glio) SF-295 


0.00 


11.83 


0.00 


0.41 


Heart 


0.00 


11.27 


0.36 


0.25 


Skeletal muscle 


0.00 


0.54 


0.48 


0.11 


Bone marrow 


0.00 


1.88 


0.06 


1.35 


Thymus 


0.00 


6.84 


0.66 


3.77 


Spleen 


0.00 


8.25 


0.12 


0.42 


Lymph node 


0.00 


2.78 


0.11 


0.50 


Colon (ascending) 


0.00 


2.90 


2.12 


0.23 


Stomach 


0.00 


9.02 


1.23 


**0.39 


Small intestine 


0.00 


8.30 


0.42 


1.73 


Colon ca. SW480 


0.00 


0.32 


0.02 


1.60 


Colon ca.* (SW480 met)SW620 


0.00 


0.52 


0.18 


3.59 


Colon ca. HT29 


0.00 


0.49 


0.05 


2.98 


Colon ca.HCT-116 


0.00 


1.15 


3.26 


58.64 


Colon ca. CaCo-2 


0.00 


5.40 


2.21 


4.77 


Colon ca. HCT-15 


0.00 


1.39 


0.32 


2.74 


Colon ca. HCC-2998 


0.00 


0.93 . 


0.15 


J .7U 


Gastric ca,* (liver met) NCI-N87 


0.00 


1.27-1 


9.61 


2 94 


Bladder 


0.13 


5.79 


1.50 




Trachea 


0.00 


8.54 


0.77 


1 91 


Kidney 


0.00 


5.11 


1.10 


0 20 


Kidney (fetal) 


0.00 


22.69 


5.11 


3 IS 


Renal ca. 786-0 


0.00 


1.10 


0.01 


2.54 


Renal ca. A498 


0.00 


1.30 


0.00 


2 19 


Renal ca. RXF 393 


0.00 


1.04 


0.00 


0.60 


Renal ca. ACHN 


0.00 


0.44 


0.00 


1.33 


Renal ca. LfO-31 


0.00 


0.85 


0.04 


0.56 


Renal ca. TK- 10 


0.00 


1.17 


0.12 


2.94 


Liver 


0.00 


2.76 


0.14 


2.78 


Liver (fetal) 


0.00 


2.24 


0.22 


3.52 


Liver ca. (hepatoblast) HepG2 


0.00 


1.29 


0.71 \ 


1.70 


Lung 


0.00 


1.41 


0.56 ' 


0.01 


Lung (fetal) 


0.00 


11.27 


16.27 


1.92 


Lung ca. (small cell) LX- 1 


0.00 


0.83 


0.32 


3.24 


Lung ca. (small cell) NCI-H69 


0.00 


8.84 


1.51 


5.48 


Lung ca. (s.cell var.) SHP-77 


0.00 


1.88 


6.98 


100.00 


Lung ca. (large cell)NCI-H460 


0.00 


1.39 


43.53 


6.93 


Lung ca. (non-sm. Cell) A549 


0.00 


1.41 


0.05 


0.84 


Lung ca. (non-s.cell) NCI-H23 


0.00 


1.10 


0.84 


2.21 


Lung ca (non-s.cell) HOP-62 


0.00 


1.24 


0.09 


0.23 


Lung ca. (non-s.cl) NCI-H522 


0.00 


2.35 


0.40 


15.39 


Lung ca. (squam.) SW 900 


0.00 


1.51 


0.78 


3.37. 


Lung ca. (squam.) NCI-H596 


0.00 


4,09 


\2\ 


7.80 


Mammary gland 


0.00 


17.31 


1.18 


0.43 


Breast ca.* (pi. effusion) MCF-7 


0.00 


1.87 


0.08 


6.75 


Breast ca.* (pl.ef) MDA-MB-231 
Breast ca.* (pi. effusion) T47D 


0.00 


0.76 


, 0.00 


1.71 


0.00 


0.98 


0.94 


1.47 


Breast ca. BT-549 


0.00 


2.74 


0.19 


1 8.30 


Breast ca. MDA-N 


0.00 


4.61 


0.17 


13.68 


Ovary 


0.00 


3.00 


0.63 


0.68 


Ovarian ca. OVCAR-3 


0.00 


0.61 


1.57 


1.63 


Ovarian ca. OVCAR-4 


0.00 


1.00 


0.80 


1.17 
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Ovarian ca. OVCAR-5 


0 00 


0 7<> 


0 4<v 


A Q*7 
H.y i 


Ovarian ca. OVCAR-S 


0 00 


0 SO 


0 14. 




Ovarian ca. IGROV- 1 


0 00 


0 SO 

w.Jv 


n no 

U.V7 


i i a 
I.1U 


Ovarian ca.* (ascites) SK-OV-3 


0 Cft 


0 i\X 


A 1ft 
U. 1 U 




My om etrium 


0 00 


1 1 4ft 


1 
1 


A A*7 


Uterus 


0 00 

v.UV 




J .JO 


a /t>f 


Placenta 


^ S9 




ft 77 
U.J / 


z.iy 


Prostate 


0 00 

\J .\J\J 




1 1 A 
1 . 1 O 


a An 


Prostate ca * ( bone metYPP-'? 


0 00 


l.ol 


1 A Q 


1 O AC 

lo.U!> 


Testis 


0.36 






9 1 7 A 
Z I . /O 


Melanoma Hs688(A).T 


0.00 


1.62 


0 00 
v. w 


ft 3** 


Melanoma* (met) Hs688(B).T 


0.20 


0.94 


0.08 


0.04 


Melanoma UACC-62 


0.00 


0.54 


0.00 


0.12 


Melanoma M14 


0.00 


1.94 


0.56 


1.25 


Melanoma LOX 1MVI 


0.00 


2.12 


0.10 


33.68 


Melanoma* (met) SK-MEL-5 


0.00 


0.96 


0.16 


2.21 


Melanoma SK-MEL-28 


0.00 


1.81 


0.01 


4.04 


Melanoma UACC-257 




0.00 


■-. 0.00 


9.02 




TM361F 


TM415T 


TM 208 F 


TM 221 F 



TaqMan expression analysis was also performed on AMF-5 and AMF-9. 

EQUIVALENTS 

Although particular embodiments have been disclosed herein in detail, this has been 
done by way of example for purposes of illustration only, and is not intended to be limiting 
with respect to the scope of the appended claims which follow. In particular, it is contemplated 
by the inventors that various substitutions' alterations, and modifications may be made to the 
invention without departing from the spirit and scope of the invention as defined by the claims. 
The choice of nucleic acid starting material, clone of interest, or library type is believed to be a 
matter of routine for a person of ordinary skill in the art with knowledge of the embodiments 
described herein. Other aspects, advantages, and modifications considered to be within the 
scope of the following claims. 
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WHAT IS CLAIMED IS: 

1 . An isolated polypeptide comprising an amino acid sequence selected from the 
consisting of: 

a mature form of an amino acid sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20; 

a variant of a mature form of an amino acid sequence selected from the group 
consisting of SEQ IDNOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20, wherein one or more 
amino acid residues in said variant differs from the amino acid sequence of said mature 
form, provided that said variant differs in no more than 15% of the amino acid residues 
from the amino acid sequence of said mature form; 

an amino acid sequence selected from the group consisting of SEQ ID NOS:2, 4, 6, 8, 
10, 12, 14, 1 6, 1 8 and 20; and 

a variant of an amino acid sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20, wherein one or more amino acid residues in 
said variant differs from the amino acid sequence of said mature form, provided that 
said variant differs in no more than 15% of amino acid residues from said amino acid 
sequence. 

2 The polypeptide of claim 1, wherein said polypeptide comprises the amino acid 
sequence of a naturally-occurring allelic variant of an amino acid sequence selected from the 
group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20. 

3. The polypeptide of claim 2, wherein said allelic variant comprises an amino 
acid sequence that is the translation of a nucleic acid sequence differing by a single nucleotide 
from a nucleic acid sequence selected from the group consisting of SEQ ID NOS:l, 3, 5, 7, 9, 
11, 13, 15, 17 and 19. 

4. The polypeptide of claim 1, wherein the amino acid sequence of said variant 
comprises a conservative amino acid substitution. 
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5. An isolated nucleic acid molecule comprising a nucleic acid sequence encoding 
a polypeptide comprising an amino acid sequence selected from the group consisting of: 

(a) a mature form of an amino acid sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20; 

(b) a variant of a mature form of an amino acid sequence selected from the group 
consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 and 20, wherein one or more 
amino acid residues in said variant differs from the amino acid sequence of said mature 
form, provided that said variant differs in no more than 1 5% of the amino acid residues 
from the amino acid sequence of said mature form; 

(c) an amino acid sequence selected from the group consisting of SEQ ID NOS:2, 4, 6, 8, 
10, 12, 14, 16, 18 and 20; " 

(d) a variant of an amino acid sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, i2, 14, 16, 1 8 and 20, wherein one or more amino acid residues in 
said variant differs from the amino acid sequence of said mature form, provided that 
said variant differs in no more than 1 5% of amino acid residues from said amino acid 
sequence; 

(e) a nucleic acid fragment encoding at least a portion of a polypeptide comprising an 
amino acid sequence chosen from the group consisting of SEQ ID NOS:2, 4, 6, 8, 1 0, 
12, 14, 16, 1 8 and 20, or a variant of said polypeptide, wherein one or more amino acid 

; residues .in said variant differs from the amino acid sequence of said mature form, 
provided that said variant differs in no more than 15% of amino acid residues from said 
amino acid sequence; and 

(f) a nucleic acid molecule comprising the complement of (a), (b), (c), (d) or (e). 

6. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule 
comprises the nucleotide sequence of a naturally-occurring allelic nucleic acid variant. 

7. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule 
encodes a polypeptide comprising the amino acid sequence of a naturally-occurring 
polypeptide variant 

8. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule differs 
by a single nucleotide from a nucleic acid sequence selected from the group consisting of SEQ 
IDNOS:l, 3, 5, 7, 9, 11, 13, 15, 17 and 19. 
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9. The nucleic acid molecule of claim 5, wherein said nucleic acid molecule 
comprises a nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence selected from the group consisting of SEQ ID NOS: 1 , 3, 
5,7,9,11,13,15, 17 and 19; 

(b) a nucleotide sequence differing by one or more nucleotides from a nucleotide 
sequence selected from the group consisting of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17and 
19, provided that no more than 20% of the nucleotides differ from said nucleotide sequence; 

(c) a nucleic acid fragment of (a); and 

(d) a nucleic acid fragment of (b). 

1 0. The nucleic acid molecule of claim 5, wherein said nucleic acid molecule 
hybridizes under stringent conditions to a nucleotide sequence chosen from the group 
consistingof SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17and 19, or a complement of said - 
nucleotide sequence. 

1 1 . The nucleic acid molecule of claim 5, wherein the nucleic acid molecule 
comprises a nucleotide sequence selected from the group consisting of: 

(a) a first nucleotide sequence comprising a coding sequence differing by one or more 
nucleotide sequences from a coding sequence encoding said amino acid sequence, 
provided that no more than 20% of the nucleotides in the coding sequence in said first 
nucleotide sequence differ from said coding sequence; 

(b) an isolated second polynucleotide that is a complement of the first polynucleotide; and 

(c) a nucleic acid fragment of (a) or (b). 

12. A vector comprising the nucleic acid molecule of claim 1 1 . 

13. The vector of claim 12, further comprising a promoter operably-linked to said 
nucleic acid molecule. 

14. A cell comprising the vector of claim 12. 

1 5. An antibody that binds immunospecifically to the polypeptide of claim 1 . 

128 

BNSDOCID:<WO 0174897A2_t_> 



WO 01/74897 



PCT/US01/10892 



16. The antibody of claim 15, wherein said antibody is a monoclonal antibody. 

17. The antibody of claim 15, wherein the antibody is a humanized antibody. 

18. A method for determining the presence or amount of the polypeptide of claim 1 
in a sample, the method comprising: 

(a) providing the sample; 

(b) contacting the sample with an antibody that bi£ds immunospecifically to the 
polypeptide; and 

(c) determining the presence or amount of antibody boxmd to said polypeptide, 
thereby determining the presence or amount of polypeptide in said sample. 

19. A method for detennining the presence or amount of the nucleic acid molecule 
of claim 5 in a sample, the method comprising: 

(a) providing the sample; 

(b) contacting the. sample with a probe that binds to said nucleic acid molecule; and 

(c) detennining the presence or amount of the probe bound to said nucleic acid molecule, 
thereby determining the presence or amount of the nucleic acid molecule in said sample. 

•t 

20. The method of claim 1 9 wherein presence or amount of the nucleic acid 
molecule is used as a marker for cell or tissue type. \ 

2 1 . The method of claim 20 wherein the cell or tissue type is cancerous. 

22. A method of identifying an agent that binds to a polypeptide of claim 1 , the 
method comprising; 

(a) contacting said polypeptide with said agent; and 

(b) detennining whether said agent binds to said polypeptide. 

23. The method of claim 22 wherein, the agent is a cellular receptor or a 
downstream effector. 

24. A method for identifying an agent that modulates the expression or activity of 
the polypeptide of claim 1, the method comprising: 
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(a) providing a cell expressing said polypeptide; 

(b) contacting the cell with said agent, and 

(c) determining whether the agent modulates expression or activity of said 
polypeptide, 

whereby an alteration in expression or activity of said peptide indicates said agent 
modulates expression or activity of said polypeptide. 

25. A method for modulating the activity of the polypeptide of claim 1 , the method 
comprising contacting a cell sample expressing the polypeptide of said claim with a compound 
that binds to said polypeptide in an amount sufficient to modulate the activity of the 
polypeptide. 

26. A method of treating or preventing a AMFX-associated disorder, said method 
comprising administering to a subject in which such treatment or prevention is desired the 
polypeptide of claim 1 in an amount sufficient to treat or prevent said AMFX-associated 
disorder in said subject. 

27. The method of claim 26 wherein the disorder is selected from the group 
consisting of disorders related to cell signed processing, cell adhesion or migration pathway 
modulation, chemoresistance, radiotherapy resistance, survival in trophic factor limited r: . 
secondary tissue site microenvironments, connective tissue disorders, tissue remodeling, 
oncogenesis, cancer of the breast, ovaiy, cervix, prostate, endometrium, stomach, colon, lung, 
bladder, kidney, brain, and soft-tissue, cellular transformation, developmental tissue 
remodeling, inflammation, blood clot formation and resorption, hematopoiesis, angiogenesis, 
multidrug resistance related to organic anion transporters, malignant disease progression, 
autocrine and paracrine regulation of cell growth, and cellular responses to external stimuli, 
and other diseases, disorders and conditions of the like. 

28. The method of claim 26 wherein the disorder is related to cell signal processing 
and metabolic pathway modulation. 

29. The method of claim 26, wherein said subject is a human. 
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30. A method of treating or preventing a AMFX-associated disorder, said method 
comprising administering to a subject in which such treatment or prevention is desired the 
nucleic acid of claim 5 in an amount sufficient to treat or prevent said AMFX-associated 
disorder in said subject. 



3 1 . The method of claim 30 wherein the disorder is selected from the group 
consisting of consisting of disorders related to cell signal processing, cell adhesion or 
migration pathway modulation, chemoresistance, radiotherapy resistance, survival in trophic 
factor limited secondary tissue site microenvironments, connective tissue disorders, tissue 
remodeling, oncogenesis, cancer of the breast, ovary, cervix," prostate, endometrium, stomach, 
colon, lung, bladder, kidney, brain, and soft-tissue, cellular transformation, developmental 
tissue remodeling, inflammation, blood clot formation and resorption, hematopoiesis, 
angiogenesis, multidrug resistance related to organic anion transporters, malignant disease 
progression, autocrine and paracrine regulation of cell growth, and cellular responses to 
external stimuli, and other diseases, disorders and conditions of the like. 

32. The method of claim 30 wherein the disorder is related to ceil signal processing 
and metabolic pathway modulation. 

33. Themethodof claim 30, wherein said subject is a human. 

34. A method of treating or preventing a AMFX-associated disorder" said method 
comprising administering to a subject in which such treatment or prevention is desired the 
antibody of claim 15 in an amount sufficient to treat or prevent said AMFX-associated disorder 
in said subject 

35. The method of claim 34 wherein the disorder is selected from the group 
consisting of disorders related to cell signal processing, cell adhesion or migration pathway 
modulation, chemoresistance, radiotherapy resistance, survival in trophic factor limited 
secondary tissue site microenviroiunents, connective tissue disorders, tissue remodeling, 
oncogenesis, cancer of the breast, ovary, cervix, prostate, endometrium, stomach, colon, lung, 
bladder, kidney, brain, and soft-tissue, cellular transformation, developmental tissue 
remodeling, inflammation, blood clot formation and resorption, hematopoiesis, angiogenesis, 
multidrug resistance related to organic anion transporters, malignant disease progression, 
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autocrine and paracrine regulation of cell growth, and cellular responses to external stimuli, 
and other diseases, disorders and conditions of the like. 

36. The method of claim 34 wherein the disorder is related to cell signal processing 
and metabolic pathway modulation. 

37. The method of claim 34, wherein the subject is a human. 

38. A pharmaceutical composition comprising the polypeptide of claim 1 and a 
pharmaceutically-acceptable carrier. 

39. A pharmaceutical composition comprising the nucleic acid molecule of claim 5 
and a pharmaceutically-acceptable carrier. 

40. A pharmaceutical composition comprising the antibody of claim 1 5 and a 
pharmaceutically-acceptable carrier. 

41 . A kit comprising in one or more containers, the pharmaceutical composition of 
claim 38. 

42. A kit comprising in one or more containers, the pharmaceutical composition of 
claim 39. 

43. A kit comprising in one or more containers, the pharmaceutical composition of 
claim 40. 



44. A method for determining the presence of or predisposition to a disease 
associated with altered levels of the polypeptide of claim 1 in a first mammalian subject, the 
method comprising: 

(a) measuring the level of expression of the polypeptide in a sample from the first 
mammalian subject; and 

(b) comparing the amount of said polypeptide in the sample of step (a) to the amount of the 
polypeptide present in a control sample from a second mammalian subject known not 
to have, or not to be predisposed to, said disease; 
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wherein an alteration in the expression level of the polypeptide in the first subject as 
compared to the control sample indicates the presence of or predisposition to said disease. 

45. The method of claim 44 wherein the predisposition is to cancers. 

46. A method for determining the presence of or predisposition to a disease 
associated with altered levels of the nucleic acid molecule of claim 5 in a first mammalian 
subject, the method comprising: 

(a) measuring the amount of the nucleic acid in a sample from the first mammalian subject; 
and 

(b) comparing the amount of said nucleic acid in the sample of step (a) to the amount of the 
nucleic acid present in a control sample from a second mammalian subject known not 
to have or not be predisposed to, the disease; 

wherein an alteration in the level of the nucleic acid in the first subject as compared to 
the control sample indicates the presence of or predisposition to the disease. 

47. The method of claim 46 wherein the predisposition is to cancers. 

48. A method of treating a pathological state in a mammal, the method comprising 
administering to the mammal a polypeptide in an amount that is sufficient to alleviate the 
pathological state, wherein the polypeptide is a polypeptide having an amino acid sequence at 
least 95% identical to a polypeptide comprising an amino acid sequence of at least one of SEQ 
IDNOS:2, 4, 6, 8, 10, 12, 14, 16, 1 8 and 20, or a biologically active fragment thereof. 

. 49. A method of treating a pathological state in a mammal, the method comprising 
administering to the mammal the antibody of claim 15 in an amount sufficient to alleviate the 
pathological state. 
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